From marc at ena.com Thu Sep 1 00:14:03 2005 From: marc at ena.com (Marc Powell) Date: Wed, 31 Aug 2005 17:14:03 -0500 Subject: 404 Not Found Message-ID: > -----Original Message----- > From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users- > admin at lists.sourceforge.net] On Behalf Of Lee Ball > Sent: Wednesday, August 31, 2005 3:20 PM > To: Nagios-users at lists.sourceforge.net > Subject: Re: [Nagios-users] 404 Not Found > > I found that although the manual says you can get to it at /nagios/ its > actually /nagios instead. Try that. It depends on how you specified the Apache Alias. If it's /nagios/ then you must have the trailing slash for it to match. If it's /nagios then you don't. [snip] > Marc Powell wrote: > > Please try to respond in context. It makes it difficult for someone > > reading this thread in the future to follow exactly what's going on. So much for that.... -- Marc ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From lee at effective-it.co.uk Thu Sep 1 00:53:22 2005 From: lee at effective-it.co.uk (Lee Ball) Date: Wed, 31 Aug 2005 23:53:22 +0100 Subject: 404 Not Found In-Reply-To: References: Message-ID: <431634E2.40806@effective-it.co.uk> I have to disagree here, I find it more awkward to have to scroll down through what has already been written to view the new bits, especially when sometimes the replies don't go in the >> format properly. I know to change the config of apache to /nagios/ (only after realising the manual conflicted with itself). But if someone follows it word for word then they might have an issue. Marc Powell wrote: > >>-----Original Message----- >>From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users- >>admin at lists.sourceforge.net] On Behalf Of Lee Ball >>Sent: Wednesday, August 31, 2005 3:20 PM >>To: Nagios-users at lists.sourceforge.net >>Subject: Re: [Nagios-users] 404 Not Found >> >>I found that although the manual says you can get to it at /nagios/ > > its > >>actually /nagios instead. Try that. > > > It depends on how you specified the Apache Alias. If it's /nagios/ then > you must have the trailing slash for it to match. If it's /nagios then > you don't. > > [snip] > > >>Marc Powell wrote: >> >>>Please try to respond in context. It makes it difficult for someone >>>reading this thread in the future to follow exactly what's going on. > > > So much for that.... > > -- > Marc > > > ------------------------------------------------------- > SF.Net email is Sponsored by the Better Software Conference & EXPO > September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices > Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA > Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nagios at jehster.net Thu Sep 1 03:24:51 2005 From: nagios at jehster.net (Roy Kidder) Date: Wed, 31 Aug 2005 21:24:51 -0400 (EDT) Subject: Check remote MS service status In-Reply-To: <33555.206.131.211.142.1125419676.squirrel@206.131.211.142> References: <33555.206.131.211.142.1125419676.squirrel@206.131.211.142> Message-ID: <47326.192.168.1.105.1125537891.squirrel@192.168.1.105> > My main interest is checking if a remote service is running (one that is > not network-reachable). Anyone ever heard of a perl script I can run on my > Linux box to ask the NT box via RPC if a service is running? Or is nrpe > the way to go? Thanks for all the responses I got to this question. I'll give the different suggestions a try and see which works best for me. Roy ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From elizar.palad at gmail.com Thu Sep 1 04:49:50 2005 From: elizar.palad at gmail.com (Elizar M. Palad) Date: Thu, 1 Sep 2005 10:49:50 +0800 Subject: @sysconfdir@/cgi.cfg - Installation setup problem In-Reply-To: References: Message-ID: sorry, should have checked the cfg file first.. :) set the use_authentication to 0.. the page displays.. but warning No Output is displayed in Status Infromation..? On 9/1/05, Elizar M. Palad wrote: > > Hi! Ok, i rerun configure script after installed perl, seems ok. no error > and all > the .h files are created and there's no more error in make install-config > (i had one before) > i am using the minimal.cfg to test nagios. > I started nagios without any options. > /usr/local/nagios/bin/nagios /usr/local/nagios/etc/nagios.cfg > and it says: > sh: @libexecdir@/check_ping: not found > sh: @libexecdir@/check_users: not found > sh: @libexecdir@/check_procs: not found > Is this ok? > The IE page now has an error... > It appears as though you do not have permission to view information for > any of the hosts you requested... > > If you believe this is an error, check the HTTP server authentication > requirements for accessing this CGI > and check the authorization options in your CGI configuration file. > > I didn't set up any authentication for nagios. Maybe it caused the error > above? > if it does, how to use nagios without authentication? > thanks! > > > On 8/31/05, Subhendu Ghosh wrote: > > > > > > Yes - autoconf which creates configure uses perl :) > > > > You can either compile your own or sunfreeware.comhas some nice pkgadd > > versions. > > > > -sg > > > > > > On Wed, 31 Aug 2005, Elizar M. Palad wrote: > > > > > Hi, thanks! > > > from ./configure's output: > > > > > > creating html/index.html > > > creating html/side.html > > > creating include/config.h > > > creating include/snprintf.h > > > creating include/nagios.h > > > creating include/cgiutils.h > > > ./configure: perl: not found > > > ./configure: perl: not found > > > > > > Creating sample config files in sample-config/ ... > > > ./configure: perl: not found > > > ./configure: perl: not found > > > ./configure: perl: not found > > > ./configure: perl: not found > > > ./configure: perl: not found > > > ./configure: perl: not found > > > ./configure: perl: not found > > > ./configure: perl: not found > > > > > > > > > *** Configuration summary for nagios 2.0b4 08-02-2005 ***: > > > > > > General Options: > > > ------------------------- > > > Nagios executable: nagios > > > Nagios user/group: nagios,nagios > > > Command user/group: nagios,nagios > > > Embedded Perl: no > > > Event Broker: yes > > > Install ${prefix}: /usr/local/nagios > > > Lock file: ${prefix}/var/nagios.lock > > > Init directory: /etc/init.d > > > Host OS: solaris2.6 > > > .... > > > Im going to install now perl and see if it will create the location.hfile. > > > Thanks! > > > eli > > > > > > On 8/31/05, Subhendu Ghosh wrote: > > >> > > >> On Wed, 31 Aug 2005, Elizar M. Palad wrote: > > >> > > >>> I rerun ./configure --prefix=/usr/local/nagios > > >> --with-cgiurl=/nagios/cgi-bin > > >>> --with-htmlurl=/nagios --with-nagios-user=nagios > > >> --with-nagios-groupp=nagios > > >>> --with-gd-lib=/usr/local/lib > > >>> and summary is below: > > >>> *** Configuration summary for nagios 2.0b4 08-02-2005 ***: > > >>> > > >>> General Options: > > >>> ------------------------- > > >>> Nagios executable: nagios > > >>> Nagios user/group: nagios,nagios > > >>> Command user/group: nagios,nagios > > >>> Embedded Perl: no > > >>> Event Broker: yes > > >>> Install ${prefix}: /usr/local/nagios > > >>> Lock file: ${prefix}/var/nagios.lock > > >>> Init directory: /etc/init.d > > >>> Host OS: solaris2.6 > > >>> > > >>> Web Interface Options: > > >>> ------------------------ > > >>> HTML URL: http://localhost/nagios/ > > >>> CGI URL: http://localhost/nagios/cgi-bin/ > > >>> Traceroute (used by WAP): > > >>> > > >>> > > >>> Review the options above for accuracy. If they look okay, > > >>> type 'make all' to compile the main program and CGIs. > > >>> ------------------ > > >>> Looks ok isn't it? > > >>> then when i do 'make all', i have this error: > > >>> # make all > > >>> cd ./base && make > > >>> make: Fatal error: Don't know how to make target > > >> `../include/locations.h' > > >>> Current working directory /packages/nagios-2.0b4/base > > >>> *** Error code 1 > > >>> make: Fatal error: Command failed for target `all' > > >>> This is also my second time with this error. What I did was i copy > > >>> locations.h.in < http://locations.h.in> < > > http://locations.h.in> as > > >> locations.h :-) > > >>> and the compilation continues.. (was that not right? :) > > >>> thanks! > > >> > > >> > > >> NOOOOO - configure must have produced an error before the summary was > > >> printed. > > >> > > >> configure should create a locations.h from locations.h.in > > after > > >> appropriate substitutions. > > >> > > >> If the file is not being created properly - you may have a file > > permission > > >> issue. > > >> > > >> run "chown -R user:group nagios_src_dir/" > > >> > > >> replace the user and group with the user and group info fort he > > person > > >> running configure > > >> > > >> replace nagios_src_dir with the dir that contains the nagios src. > > >> > > >> Never copy a *.*.in file into a *.* file. > > >> > > >> > > >> > > >> -- > > >> > > >> -sg > > >> > > > > > > > > > > > > > > > > -- > > > > > > > -- > ---- > Don't Tell Me How Hard You Work.. > Show Me How Much You'd Accomplished.. > -- ---- Don't Tell Me How Hard You Work.. Show Me How Much You'd Accomplished.. -------------- next part -------------- An HTML attachment was scrubbed... URL: From elizar.palad at gmail.com Thu Sep 1 04:45:46 2005 From: elizar.palad at gmail.com (Elizar M. Palad) Date: Thu, 1 Sep 2005 10:45:46 +0800 Subject: @sysconfdir@/cgi.cfg - Installation setup problem In-Reply-To: References: Message-ID: Hi! Ok, i rerun configure script after installed perl, seems ok. no error and all the .h files are created and there's no more error in make install-config (i had one before) i am using the minimal.cfg to test nagios. I started nagios without any options. /usr/local/nagios/bin/nagios /usr/local/nagios/etc/nagios.cfg and it says: sh: @libexecdir@/check_ping: not found sh: @libexecdir@/check_users: not found sh: @libexecdir@/check_procs: not found Is this ok? The IE page now has an error... It appears as though you do not have permission to view information for any of the hosts you requested... If you believe this is an error, check the HTTP server authentication requirements for accessing this CGI and check the authorization options in your CGI configuration file. I didn't set up any authentication for nagios. Maybe it caused the error above? if it does, how to use nagios without authentication? thanks! On 8/31/05, Subhendu Ghosh wrote: > > > Yes - autoconf which creates configure uses perl :) > > You can either compile your own or sunfreeware.comhas some nice pkgadd > versions. > > -sg > > > On Wed, 31 Aug 2005, Elizar M. Palad wrote: > > > Hi, thanks! > > from ./configure's output: > > > > creating html/index.html > > creating html/side.html > > creating include/config.h > > creating include/snprintf.h > > creating include/nagios.h > > creating include/cgiutils.h > > ./configure: perl: not found > > ./configure: perl: not found > > > > Creating sample config files in sample-config/ ... > > ./configure: perl: not found > > ./configure: perl: not found > > ./configure: perl: not found > > ./configure: perl: not found > > ./configure: perl: not found > > ./configure: perl: not found > > ./configure: perl: not found > > ./configure: perl: not found > > > > > > *** Configuration summary for nagios 2.0b4 08-02-2005 ***: > > > > General Options: > > ------------------------- > > Nagios executable: nagios > > Nagios user/group: nagios,nagios > > Command user/group: nagios,nagios > > Embedded Perl: no > > Event Broker: yes > > Install ${prefix}: /usr/local/nagios > > Lock file: ${prefix}/var/nagios.lock > > Init directory: /etc/init.d > > Host OS: solaris2.6 > > .... > > Im going to install now perl and see if it will create the location.hfile. > > Thanks! > > eli > > > > On 8/31/05, Subhendu Ghosh wrote: > >> > >> On Wed, 31 Aug 2005, Elizar M. Palad wrote: > >> > >>> I rerun ./configure --prefix=/usr/local/nagios > >> --with-cgiurl=/nagios/cgi-bin > >>> --with-htmlurl=/nagios --with-nagios-user=nagios > >> --with-nagios-groupp=nagios > >>> --with-gd-lib=/usr/local/lib > >>> and summary is below: > >>> *** Configuration summary for nagios 2.0b4 08-02-2005 ***: > >>> > >>> General Options: > >>> ------------------------- > >>> Nagios executable: nagios > >>> Nagios user/group: nagios,nagios > >>> Command user/group: nagios,nagios > >>> Embedded Perl: no > >>> Event Broker: yes > >>> Install ${prefix}: /usr/local/nagios > >>> Lock file: ${prefix}/var/nagios.lock > >>> Init directory: /etc/init.d > >>> Host OS: solaris2.6 > >>> > >>> Web Interface Options: > >>> ------------------------ > >>> HTML URL: http://localhost/nagios/ > >>> CGI URL: http://localhost/nagios/cgi-bin/ > >>> Traceroute (used by WAP): > >>> > >>> > >>> Review the options above for accuracy. If they look okay, > >>> type 'make all' to compile the main program and CGIs. > >>> ------------------ > >>> Looks ok isn't it? > >>> then when i do 'make all', i have this error: > >>> # make all > >>> cd ./base && make > >>> make: Fatal error: Don't know how to make target > >> `../include/locations.h' > >>> Current working directory /packages/nagios-2.0b4/base > >>> *** Error code 1 > >>> make: Fatal error: Command failed for target `all' > >>> This is also my second time with this error. What I did was i copy > >>> locations.h.in < > http://locations.h.in> as > >> locations.h :-) > >>> and the compilation continues.. (was that not right? :) > >>> thanks! > >> > >> > >> NOOOOO - configure must have produced an error before the summary was > >> printed. > >> > >> configure should create a locations.h from locations.h.in > after > >> appropriate substitutions. > >> > >> If the file is not being created properly - you may have a file > permission > >> issue. > >> > >> run "chown -R user:group nagios_src_dir/" > >> > >> replace the user and group with the user and group info fort he person > >> running configure > >> > >> replace nagios_src_dir with the dir that contains the nagios src. > >> > >> Never copy a *.*.in file into a *.* file. > >> > >> > >> > >> -- > >> > >> -sg > >> > > > > > > > > > > -- > > -- ---- Don't Tell Me How Hard You Work.. Show Me How Much You'd Accomplished.. -------------- next part -------------- An HTML attachment was scrubbed... URL: From sghosh at sghosh.org Thu Sep 1 07:32:10 2005 From: sghosh at sghosh.org (Subhendu Ghosh) Date: Thu, 1 Sep 2005 01:32:10 -0400 (EDT) Subject: @sysconfdir@/cgi.cfg - Installation setup problem In-Reply-To: References: Message-ID: On Thu, 1 Sep 2005, Elizar M. Palad wrote: > Hi! Ok, i rerun configure script after installed perl, seems ok. no error > and all > the .h files are created and there's no more error in make install-config (i > had one before) > i am using the minimal.cfg to test nagios. > I started nagios without any options. > /usr/local/nagios/bin/nagios /usr/local/nagios/etc/nagios.cfg > and it says: > sh: @libexecdir@/check_ping: not found > sh: @libexecdir@/check_users: not found > sh: @libexecdir@/check_procs: not found > Is this ok? No - @libexecdir@ should have been fixed by configure. For nagios - there should be no errors in the configure process... > The IE page now has an error... > It appears as though you do not have permission to view information for > any of the hosts you requested... > > If you believe this is an error, check the HTTP server authentication > requirements for accessing this CGI > and check the authorization options in your CGI configuration file. > I didn't set up any authentication for nagios. Maybe it caused the error > above? > if it does, how to use nagios without authentication? > thanks! > > > On 8/31/05, Subhendu Ghosh wrote: >> >> >> Yes - autoconf which creates configure uses perl :) >> >> You can either compile your own or sunfreeware.comhas some nice pkgadd >> versions. >> >> -sg >> >> >> On Wed, 31 Aug 2005, Elizar M. Palad wrote: >> >>> Hi, thanks! >>> from ./configure's output: >>> >>> creating html/index.html >>> creating html/side.html >>> creating include/config.h >>> creating include/snprintf.h >>> creating include/nagios.h >>> creating include/cgiutils.h >>> ./configure: perl: not found >>> ./configure: perl: not found >>> >>> Creating sample config files in sample-config/ ... >>> ./configure: perl: not found >>> ./configure: perl: not found >>> ./configure: perl: not found >>> ./configure: perl: not found >>> ./configure: perl: not found >>> ./configure: perl: not found >>> ./configure: perl: not found >>> ./configure: perl: not found >>> >>> >>> *** Configuration summary for nagios 2.0b4 08-02-2005 ***: >>> >>> General Options: >>> ------------------------- >>> Nagios executable: nagios >>> Nagios user/group: nagios,nagios >>> Command user/group: nagios,nagios >>> Embedded Perl: no >>> Event Broker: yes >>> Install ${prefix}: /usr/local/nagios >>> Lock file: ${prefix}/var/nagios.lock >>> Init directory: /etc/init.d >>> Host OS: solaris2.6 >>> .... >>> Im going to install now perl and see if it will create the location.hfile. >>> Thanks! >>> eli >>> >>> On 8/31/05, Subhendu Ghosh wrote: >>>> >>>> On Wed, 31 Aug 2005, Elizar M. Palad wrote: >>>> >>>>> I rerun ./configure --prefix=/usr/local/nagios >>>> --with-cgiurl=/nagios/cgi-bin >>>>> --with-htmlurl=/nagios --with-nagios-user=nagios >>>> --with-nagios-groupp=nagios >>>>> --with-gd-lib=/usr/local/lib >>>>> and summary is below: >>>>> *** Configuration summary for nagios 2.0b4 08-02-2005 ***: >>>>> >>>>> General Options: >>>>> ------------------------- >>>>> Nagios executable: nagios >>>>> Nagios user/group: nagios,nagios >>>>> Command user/group: nagios,nagios >>>>> Embedded Perl: no >>>>> Event Broker: yes >>>>> Install ${prefix}: /usr/local/nagios >>>>> Lock file: ${prefix}/var/nagios.lock >>>>> Init directory: /etc/init.d >>>>> Host OS: solaris2.6 >>>>> >>>>> Web Interface Options: >>>>> ------------------------ >>>>> HTML URL: http://localhost/nagios/ >>>>> CGI URL: http://localhost/nagios/cgi-bin/ >>>>> Traceroute (used by WAP): >>>>> >>>>> >>>>> Review the options above for accuracy. If they look okay, >>>>> type 'make all' to compile the main program and CGIs. >>>>> ------------------ >>>>> Looks ok isn't it? >>>>> then when i do 'make all', i have this error: >>>>> # make all >>>>> cd ./base && make >>>>> make: Fatal error: Don't know how to make target >>>> `../include/locations.h' >>>>> Current working directory /packages/nagios-2.0b4/base >>>>> *** Error code 1 >>>>> make: Fatal error: Command failed for target `all' >>>>> This is also my second time with this error. What I did was i copy >>>>> locations.h.in < >> http://locations.h.in> as >>>> locations.h :-) >>>>> and the compilation continues.. (was that not right? :) >>>>> thanks! >>>> >>>> >>>> NOOOOO - configure must have produced an error before the summary was >>>> printed. >>>> >>>> configure should create a locations.h from locations.h.in >> after >>>> appropriate substitutions. >>>> >>>> If the file is not being created properly - you may have a file >> permission >>>> issue. >>>> >>>> run "chown -R user:group nagios_src_dir/" >>>> >>>> replace the user and group with the user and group info fort he person >>>> running configure >>>> >>>> replace nagios_src_dir with the dir that contains the nagios src. >>>> >>>> Never copy a *.*.in file into a *.* file. >>>> >>>> >>>> >>>> -- >>>> >>>> -sg >>>> >>> >>> >>> >>> >> >> -- >> >> > > > -- ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mailings at good-it.com Thu Sep 1 07:50:29 2005 From: mailings at good-it.com (Johan Barelds) Date: Thu, 1 Sep 2005 07:50:29 +0200 Subject: Downtime In-Reply-To: References: Message-ID: <200509010750.30042.mailings@good-it.com> I'd love to! ..:-) Grz. Johan Op woensdag 31 augustus 2005 22:41, schreef Lori Adams: > Can you report that to the list? That way everyone else knows. > > Thanks :) > > > -----Original Message----- > > From: Johan Barelds [mailto:mailings at good-it.com] > > Sent: Wednesday, August 31, 2005 1:20 PM > > To: Lori Adams > > Subject: Re: [Nagios-users] Downtime > > > > Op woensdag 31 augustus 2005 22:16, schreef u: > > > I think there has been some confusion. I just tested this out. If > > you > > > > put the HOST into a scheduled downtime, that includes all services > > for > > > > that host. If you put just a SERVICE into a downtime, then the > > downtime > > > > is only for that service. > > > > > > The checkbox is for rescheduling checks, not downtimes. > > > > > > -Lori > > > > Hi Lori, > > > > Thanks for testing out "some confusion" ..:-) > > Your answer explains what i was looking for. > > Great stuff! > > > > Grz. Johan > > > > > > -----Original Message----- > > > > From: nagios-users-admin at lists.sourceforge.net > > [mailto:nagios-users- > > > > > admin at lists.sourceforge.net] On Behalf Of Johan Barelds > > > > Sent: Wednesday, August 31, 2005 12:19 PM > > > > To: nagios-users at lists.sourceforge.net > > > > Subject: Re: [Nagios-users] Downtime > > > > > > > > Op woensdag 31 augustus 2005 19:42, schreef Andreas Ericsson: > > > > > > I am probably overlooking, but where can i find that box?? > > > > > > I can't find it in the Nagios gui. Is it in the configuration? > > > > > > > > > > My bad. It's "Schedule downtime for all services on this host" > > (in > > > > the > > > > > > > > command menu on the right hand side in the status-view for the > > host) > > > > and > > > > > > > > then check the box "Schedule downtime for host too", rather than > > the > > > > > > other way around. > > > > > > > > Mmmm.....which version of Nagios are we talking about? > > > > I am running version 2.04b and the only thing which comes close is > > the > > > > > following: > > > > > > > > 1. Goto "Host Detail" > > > > 2. select a host by clicking on its name > > > > 3. on the right-hand site i see a box called "Host Commands" > > > > 4. The only option about downtime i see here is "Schedule downtime > > for > > > > > this > > > > host". > > > > > > > > But....there is no box saying "Schedule downtime for host too" or > > > > "Schedule > > > > downtime for all services on this host"...:-( > > > > > > > > > > > > -- > > > > Kind Regards / Met vriendelijke groet, > > > > > > > > Johan Barelds Good-IT! > > > > Tel.+31(0)70-3965230 Strijplaan 320 > > > > Mob.+31(0)6-54253750 2285 HZ Rijswijk(ZH) > > > > j.barelds at good-it.com http://www.good-it.com > > > > > > > > > > > > ------------------------------------------------------- > > > > SF.Net email is Sponsored by the Better Software Conference & EXPO > > > > September 19-22, 2005 * San Francisco, CA * Development Lifecycle > > > > Practices > > > > Agile & Plan-Driven Development * Managing Projects & Teams * > > Testing > > > > & QA > > > > > > > Security * Process Improvement & Measurement * > > > > > > http://www.sqe.com/bsce5sf > > > > > > > _______________________________________________ > > > > Nagios-users mailing list > > > > Nagios-users at lists.sourceforge.net > > > > https://lists.sourceforge.net/lists/listinfo/nagios-users > > > > > > > > ::: Please include Nagios version, plugin version (-v) and OS when > > > > > > > > reporting any issue. > > > > > > > > ::: Messages without supporting info will risk being sent to > > /dev/null > > > -- > > Kind Regards / Met vriendelijke groet, > > > > Johan Barelds Good-IT! > > Tel.+31(0)70-3965230 Strijplaan 320 > > Mob.+31(0)6-54253750 2285 HZ Rijswijk(ZH) > > j.barelds at good-it.com http://www.good-it.com -- Kind Regards / Met vriendelijke groet, Johan Barelds Good-IT! Tel.+31(0)70-3965230 Strijplaan 320 Mob.+31(0)6-54253750 2285 HZ Rijswijk(ZH) j.barelds at good-it.com http://www.good-it.com ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From elizar.palad at gmail.com Thu Sep 1 08:45:55 2005 From: elizar.palad at gmail.com (Elizar M. Palad) Date: Thu, 1 Sep 2005 14:45:55 +0800 Subject: @sysconfdir@/cgi.cfg - Installation setup problem In-Reply-To: References: Message-ID: Hi Subhendu, I rerun the ./configure... same options and i couldn't find any critical errors. I attached the output produced. The only thing that looks like an error is the gd, which, according to the documentation, nagios will run without it. # ./configure --prefix=/usr/local/nagios --with-cgiurl=/nagios/cgi-bin --with-htmlurl=/nagios --with-nagios-user=na gios --with-nagios-groupp=nagios --with-gd-lib=/usr/local/lib checking for a BSD compatible install... ./install-sh -c checking host system type... sparc-sun-solaris2.6 checking for gcc... gcc checking whether the C compiler (gcc ) works... yes checking whether the C compiler (gcc ) is a cross-compiler... no checking whether we are using GNU C... yes checking whether gcc accepts -g... yes checking whether make sets ${MAKE}... yes checking for strip... /usr/ccs/bin/strip checking how to run the C preprocessor... gcc -E checking for ANSI C header files... yes checking whether time.h and sys/time.h may both be included... yes checking for sys/wait.h that is POSIX.1 compatible... yes checking for arpa/inet.h... yes checking for ctype.h... yes checking for dirent.h... yes checking for errno.h... yes checking for fcntl.h... yes checking for getopt.h... no checking for grp.h... yes checking for limits.h... yes checking for math.h... yes checking for netdb.h... yes checking for netinet/in.h... yes checking for pthread.h... yes checking for pthreads.h... no checking for pwd.h... yes checking for regex.h... yes checking for signal.h... yes checking for socket.h... no checking for string.h... yes checking for strings.h... yes checking for sys/mman.h... yes checking for sys/types.h... yes checking for sys/time.h... yes checking for sys/resource.h... yes checking for sys/wait.h... (cached) yes checking for sys/socket.h... yes checking for sys/stat.h... yes checking for sys/timeb.h... yes checking for sys/un.h... yes checking for sys/ipc.h... yes checking for sys/msg.h... yes checking for sys/poll.h... yes checking for syslog.h... yes checking for uio.h... no checking for unistd.h... yes checking for working const... yes checking whether struct tm is in sys/time.h or time.h... time.h checking for tm_zone in struct tm... no checking for tzname... yes checking for mode_t... yes checking for pid_t... yes checking for size_t... yes checking return type of signal handlers... void checking for uid_t in sys/types.h... yes checking type of array argument to getgroups... gid_t checking for initgroups... yes checking for setenv... no checking for strdup... yes checking for strstr... yes checking for strtoul... yes checking for unsetenv... no checking for type of socket size... size_t checking for pthread_create in -lcma... no checking for pthread_create in -lpthread... yes checking for library containing nanosleep... -lposix4 checking for mail... /usr/bin/mail Init script directory: /etc/init.d We'll use default routines (in xdata/xsddefault.*) for status data I/O... We'll use default routines (in xdata/xcddefault.*) for comment data I/O... We'll use default routines (in xdata/xrddefault.*) for retention data I/O... We'll use template-based routines (in xdata/xodtemplate.*) for object data I/O... We'll use default routines (in xdata/xpddefault.*) for performance data I/O... We'll use default routines (in xdata/xdddefault.*) for scheduled downtime data I/O... checking for main in -liconv... yes checking for gdImagePng in -lgd (order 1)... no checking for gdImagePng in -lgd (order 2)... no checking for gdImagePng in -lgd (order 3)... no checking for gdImagePng in -lgd (order 4)... no *** GD, PNG, and/or JPEG libraries could not be located... ********* Boutell's GD library is required to compile the statusmap, trends and histogram CGIs. Get it from http://www.boutell.com/gd/, compile it, and use the --with-gd-lib and --with-gd-inc arguments to specify the locations of the GD library and include files. NOTE: In addition to the gd-devel library, you'll also need to make sure you have the png-devel and jpeg-devel libraries installed on your system. NOTE: After you install the necessary libraries on your system: 1. Make sure /etc/ld.so.conf has an entry for the directory in which the GD, PNG, and JPEG libraries are installed. 2. Run 'ldconfig' to update the run-time linker options. 3. Run 'make clean' in the Nagios distribution to clean out any old references to your previous compile. 4. Rerun the configure script. NOTE: If you can't get the configure script to recognize the GD libs on your system, get over it and move on to other things. The CGIs that use the GD libs are just a small part of the entire Nagios package. Get everything else working first and then revisit the problem. Make sure to check the nagios-users mailing list archives for possible solutions to GD library problems when you resume your troubleshooting. ******************************************************************** checking for ltdl.h... no checking for dlfcn.h... yes checking for dlopen in -ldl... yes checking for extra flags needed to export symbols... none checking for linker flags for loadable modules... -G checking for linker flags for loadable modules... -G checking for traceroute... no checking for snprintf... yes checking for type va_list... yes checking for perl... /usr/local/bin/perl creating ./config.status creating Makefile creating subst creating pkginfo creating base/Makefile creating common/Makefile creating contrib/Makefile creating cgi/Makefile creating html/Makefile creating module/Makefile creating include/Makefile creating xdata/Makefile creating daemon-init creating html/index.html creating html/side.html creating include/config.h include/config.h is unchanged creating include/snprintf.h include/snprintf.h is unchanged creating include/nagios.h creating include/cgiutils.h include/cgiutils.h is unchanged Creating sample config files in sample-config/ ... *** Configuration summary for nagios 2.0b4 08-02-2005 ***: General Options: ------------------------- Nagios executable: nagios Nagios user/group: nagios,nagios Command user/group: nagios,nagios Embedded Perl: no Event Broker: yes Install ${prefix}: /usr/local/nagios Lock file: ${prefix}/var/nagios.lock Init directory: /etc/init.d Host OS: solaris2.6 Web Interface Options: ------------------------ HTML URL: http://localhost/nagios/ CGI URL: http://localhost/nagios/cgi-bin/ Traceroute (used by WAP): Review the options above for accuracy. If they look okay, type 'make all' to compile the main program and CGIs. On 9/1/05, Subhendu Ghosh wrote: > > On Thu, 1 Sep 2005, Elizar M. Palad wrote: > > > Hi! Ok, i rerun configure script after installed perl, seems ok. no > error > > and all > > the .h files are created and there's no more error in make > install-config (i > > had one before) > > i am using the minimal.cfg to test nagios. > > I started nagios without any options. > > /usr/local/nagios/bin/nagios /usr/local/nagios/etc/nagios.cfg > > and it says: > > sh: @libexecdir@/check_ping: not found > > sh: @libexecdir@/check_users: not found > > sh: @libexecdir@/check_procs: not found > > Is this ok? > > No - @libexecdir@ should have been fixed by configure. > > For nagios - there should be no errors in the configure process... > > > > The IE page now has an error... > > It appears as though you do not have permission to view information for > > any of the hosts you requested... > > > > If you believe this is an error, check the HTTP server authentication > > requirements for accessing this CGI > > and check the authorization options in your CGI configuration file. > > I didn't set up any authentication for nagios. Maybe it caused the error > > above? > > if it does, how to use nagios without authentication? > > thanks! > > > > > > On 8/31/05, Subhendu Ghosh wrote: > >> > >> > >> Yes - autoconf which creates configure uses perl :) > >> > >> You can either compile your own or sunfreeware.com > has some nice pkgadd > >> versions. > >> > >> -sg > >> > >> > >> On Wed, 31 Aug 2005, Elizar M. Palad wrote: > >> > >>> Hi, thanks! > >>> from ./configure's output: > >>> > >>> creating html/index.html > >>> creating html/side.html > >>> creating include/config.h > >>> creating include/snprintf.h > >>> creating include/nagios.h > >>> creating include/cgiutils.h > >>> ./configure: perl: not found > >>> ./configure: perl: not found > >>> > >>> Creating sample config files in sample-config/ ... > >>> ./configure: perl: not found > >>> ./configure: perl: not found > >>> ./configure: perl: not found > >>> ./configure: perl: not found > >>> ./configure: perl: not found > >>> ./configure: perl: not found > >>> ./configure: perl: not found > >>> ./configure: perl: not found > >>> > >>> > >>> *** Configuration summary for nagios 2.0b4 08-02-2005 ***: > >>> > >>> General Options: > >>> ------------------------- > >>> Nagios executable: nagios > >>> Nagios user/group: nagios,nagios > >>> Command user/group: nagios,nagios > >>> Embedded Perl: no > >>> Event Broker: yes > >>> Install ${prefix}: /usr/local/nagios > >>> Lock file: ${prefix}/var/nagios.lock > >>> Init directory: /etc/init.d > >>> Host OS: solaris2.6 > >>> .... > >>> Im going to install now perl and see if it will create the > location.hfile. > >>> Thanks! > >>> eli > >>> > >>> On 8/31/05, Subhendu Ghosh wrote: > >>>> > >>>> On Wed, 31 Aug 2005, Elizar M. Palad wrote: > >>>> > >>>>> I rerun ./configure --prefix=/usr/local/nagios > >>>> --with-cgiurl=/nagios/cgi-bin > >>>>> --with-htmlurl=/nagios --with-nagios-user=nagios > >>>> --with-nagios-groupp=nagios > >>>>> --with-gd-lib=/usr/local/lib > >>>>> and summary is below: > >>>>> *** Configuration summary for nagios 2.0b4 08-02-2005 ***: > >>>>> > >>>>> General Options: > >>>>> ------------------------- > >>>>> Nagios executable: nagios > >>>>> Nagios user/group: nagios,nagios > >>>>> Command user/group: nagios,nagios > >>>>> Embedded Perl: no > >>>>> Event Broker: yes > >>>>> Install ${prefix}: /usr/local/nagios > >>>>> Lock file: ${prefix}/var/nagios.lock > >>>>> Init directory: /etc/init.d > >>>>> Host OS: solaris2.6 > >>>>> > >>>>> Web Interface Options: > >>>>> ------------------------ > >>>>> HTML URL: http://localhost/nagios/ > >>>>> CGI URL: http://localhost/nagios/cgi-bin/ > >>>>> Traceroute (used by WAP): > >>>>> > >>>>> > >>>>> Review the options above for accuracy. If they look okay, > >>>>> type 'make all' to compile the main program and CGIs. > >>>>> ------------------ > >>>>> Looks ok isn't it? > >>>>> then when i do 'make all', i have this error: > >>>>> # make all > >>>>> cd ./base && make > >>>>> make: Fatal error: Don't know how to make target > >>>> `../include/locations.h' > >>>>> Current working directory /packages/nagios-2.0b4/base > >>>>> *** Error code 1 > >>>>> make: Fatal error: Command failed for target `all' > >>>>> This is also my second time with this error. What I did was i copy > >>>>> locations.h.in < > http://locations.h.in> < > >> http://locations.h.in> as > >>>> locations.h :-) > >>>>> and the compilation continues.. (was that not right? :) > >>>>> thanks! > >>>> > >>>> > >>>> NOOOOO - configure must have produced an error before the summary was > >>>> printed. > >>>> > >>>> configure should create a locations.h from locations.h.in > > >> after > >>>> appropriate substitutions. > >>>> > >>>> If the file is not being created properly - you may have a file > >> permission > >>>> issue. > >>>> > >>>> run "chown -R user:group nagios_src_dir/" > >>>> > >>>> replace the user and group with the user and group info fort he > person > >>>> running configure > >>>> > >>>> replace nagios_src_dir with the dir that contains the nagios src. > >>>> > >>>> Never copy a *.*.in file into a *.* file. > >>>> > >>>> > >>>> > >>>> -- > >>>> > >>>> -sg > >>>> > >>> > >>> > >>> > >>> > >> > >> -- > >> > >> > > > > > > > > -- > > > > ------------------------------------------------------- > SF.Net email is Sponsored by the Better Software Conference & EXPO > September 19-22, 2005 * San Francisco, CA * Development Lifecycle > Practices > Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA > Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -- ---- Don't Tell Me How Hard You Work.. Show Me How Much You'd Accomplished.. -------------- next part -------------- An HTML attachment was scrubbed... URL: From sghosh at sghosh.org Thu Sep 1 09:22:49 2005 From: sghosh at sghosh.org (Subhendu Ghosh) Date: Thu, 1 Sep 2005 03:22:49 -0400 (EDT) Subject: @sysconfdir@/cgi.cfg - Installation setup problem In-Reply-To: References: Message-ID: configure looks good here. In /usr/local/nagios/etc - do you only have one resource.cfg file? or do you have resource.cfg.in as well? you might want to delete every thing in /usr/local/nagios/etc and run "make install-config" again to get the base sample-config -sg On Thu, 1 Sep 2005, Elizar M. Palad wrote: > Hi Subhendu, > I rerun the ./configure... same options and i couldn't find any critical > errors. > I attached the output produced. The only thing that looks like an error is > the gd, which, according to the documentation, nagios will run without it. > > # ./configure --prefix=/usr/local/nagios --with-cgiurl=/nagios/cgi-bin > --with-htmlurl=/nagios --with-nagios-user=na > gios --with-nagios-groupp=nagios --with-gd-lib=/usr/local/lib > checking for a BSD compatible install... ./install-sh -c > checking host system type... sparc-sun-solaris2.6 > checking for gcc... gcc > checking whether the C compiler (gcc ) works... yes > checking whether the C compiler (gcc ) is a cross-compiler... no > checking whether we are using GNU C... yes > checking whether gcc accepts -g... yes > checking whether make sets ${MAKE}... yes > checking for strip... /usr/ccs/bin/strip > checking how to run the C preprocessor... gcc -E > checking for ANSI C header files... yes > checking whether time.h and sys/time.h may both be included... yes > checking for sys/wait.h that is POSIX.1 compatible... yes > checking for arpa/inet.h... yes > checking for ctype.h... yes > checking for dirent.h... yes > checking for errno.h... yes > checking for fcntl.h... yes > checking for getopt.h... no > checking for grp.h... yes > checking for limits.h... yes > checking for math.h... yes > checking for netdb.h... yes > checking for netinet/in.h... yes > checking for pthread.h... yes > checking for pthreads.h... no > checking for pwd.h... yes > checking for regex.h... yes > checking for signal.h... yes > checking for socket.h... no > checking for string.h... yes > checking for strings.h... yes > checking for sys/mman.h... yes > checking for sys/types.h... yes > checking for sys/time.h... yes > checking for sys/resource.h... yes > checking for sys/wait.h... (cached) yes > checking for sys/socket.h... yes > checking for sys/stat.h... yes > checking for sys/timeb.h... yes > checking for sys/un.h... yes > checking for sys/ipc.h... yes > checking for sys/msg.h... yes > checking for sys/poll.h... yes > checking for syslog.h... yes > checking for uio.h... no > checking for unistd.h... yes > checking for working const... yes > checking whether struct tm is in sys/time.h or time.h... time.h > checking for tm_zone in struct tm... no > checking for tzname... yes > checking for mode_t... yes > checking for pid_t... yes > checking for size_t... yes > checking return type of signal handlers... void > checking for uid_t in sys/types.h... yes > checking type of array argument to getgroups... gid_t > checking for initgroups... yes > checking for setenv... no > checking for strdup... yes > checking for strstr... yes > checking for strtoul... yes > checking for unsetenv... no > checking for type of socket size... size_t > checking for pthread_create in -lcma... no > checking for pthread_create in -lpthread... yes > checking for library containing nanosleep... -lposix4 > checking for mail... /usr/bin/mail > Init script directory: /etc/init.d > We'll use default routines (in xdata/xsddefault.*) for status data I/O... > We'll use default routines (in xdata/xcddefault.*) for comment data I/O... > We'll use default routines (in xdata/xrddefault.*) for retention data I/O... > We'll use template-based routines (in xdata/xodtemplate.*) for object data > I/O... > We'll use default routines (in xdata/xpddefault.*) for performance data > I/O... > We'll use default routines (in xdata/xdddefault.*) for scheduled downtime > data I/O... > checking for main in -liconv... yes > checking for gdImagePng in -lgd (order 1)... no > checking for gdImagePng in -lgd (order 2)... no > checking for gdImagePng in -lgd (order 3)... no > checking for gdImagePng in -lgd (order 4)... no > > > *** GD, PNG, and/or JPEG libraries could not be located... ********* > > Boutell's GD library is required to compile the statusmap, trends > and histogram CGIs. Get it from http://www.boutell.com/gd/, compile > it, and use the --with-gd-lib and --with-gd-inc arguments to specify > the locations of the GD library and include files. > > NOTE: In addition to the gd-devel library, you'll also need to make > sure you have the png-devel and jpeg-devel libraries installed > on your system. > > NOTE: After you install the necessary libraries on your system: > 1. Make sure /etc/ld.so.conf has an entry for the directory in > which the GD, PNG, and JPEG libraries are installed. > 2. Run 'ldconfig' to update the run-time linker options. > 3. Run 'make clean' in the Nagios distribution to clean out > any old references to your previous compile. > 4. Rerun the configure script. > > NOTE: If you can't get the configure script to recognize the GD libs > on your system, get over it and move on to other things. The > CGIs that use the GD libs are just a small part of the entire > Nagios package. Get everything else working first and then > revisit the problem. Make sure to check the nagios-users > mailing list archives for possible solutions to GD library > problems when you resume your troubleshooting. > > ******************************************************************** > > > checking for ltdl.h... no > checking for dlfcn.h... yes > checking for dlopen in -ldl... yes > checking for extra flags needed to export symbols... none > checking for linker flags for loadable modules... -G > checking for linker flags for loadable modules... -G > checking for traceroute... no > checking for snprintf... yes > checking for type va_list... yes > checking for perl... /usr/local/bin/perl > creating ./config.status > creating Makefile > creating subst > creating pkginfo > creating base/Makefile > creating common/Makefile > creating contrib/Makefile > creating cgi/Makefile > creating html/Makefile > creating module/Makefile > creating include/Makefile > creating xdata/Makefile > creating daemon-init > creating html/index.html > creating html/side.html > creating include/config.h > include/config.h is unchanged > creating include/snprintf.h > include/snprintf.h is unchanged > creating include/nagios.h > creating include/cgiutils.h > include/cgiutils.h is unchanged > > Creating sample config files in sample-config/ ... > > > *** Configuration summary for nagios 2.0b4 08-02-2005 ***: > > General Options: > ------------------------- > Nagios executable: nagios > Nagios user/group: nagios,nagios > Command user/group: nagios,nagios > Embedded Perl: no > Event Broker: yes > Install ${prefix}: /usr/local/nagios > Lock file: ${prefix}/var/nagios.lock > Init directory: /etc/init.d > Host OS: solaris2.6 > > Web Interface Options: > ------------------------ > HTML URL: http://localhost/nagios/ > CGI URL: http://localhost/nagios/cgi-bin/ > Traceroute (used by WAP): > > > Review the options above for accuracy. If they look okay, > type 'make all' to compile the main program and CGIs. > > > On 9/1/05, Subhendu Ghosh wrote: >> >> On Thu, 1 Sep 2005, Elizar M. Palad wrote: >> >>> Hi! Ok, i rerun configure script after installed perl, seems ok. no >> error >>> and all >>> the .h files are created and there's no more error in make >> install-config (i >>> had one before) >>> i am using the minimal.cfg to test nagios. >>> I started nagios without any options. >>> /usr/local/nagios/bin/nagios /usr/local/nagios/etc/nagios.cfg >>> and it says: >>> sh: @libexecdir@/check_ping: not found >>> sh: @libexecdir@/check_users: not found >>> sh: @libexecdir@/check_procs: not found >>> Is this ok? >> >> No - @libexecdir@ should have been fixed by configure. >> >> For nagios - there should be no errors in the configure process... >> >> >>> The IE page now has an error... >>> It appears as though you do not have permission to view information for >>> any of the hosts you requested... >>> >>> If you believe this is an error, check the HTTP server authentication >>> requirements for accessing this CGI >>> and check the authorization options in your CGI configuration file. >>> I didn't set up any authentication for nagios. Maybe it caused the error >>> above? >>> if it does, how to use nagios without authentication? >>> thanks! >>> >>> >>> On 8/31/05, Subhendu Ghosh wrote: >>>> >>>> >>>> Yes - autoconf which creates configure uses perl :) >>>> >>>> You can either compile your own or sunfreeware.com >> has some nice pkgadd >>>> versions. >>>> >>>> -sg >>>> >>>> >>>> On Wed, 31 Aug 2005, Elizar M. Palad wrote: >>>> >>>>> Hi, thanks! >>>>> from ./configure's output: >>>>> >>>>> creating html/index.html >>>>> creating html/side.html >>>>> creating include/config.h >>>>> creating include/snprintf.h >>>>> creating include/nagios.h >>>>> creating include/cgiutils.h >>>>> ./configure: perl: not found >>>>> ./configure: perl: not found >>>>> >>>>> Creating sample config files in sample-config/ ... >>>>> ./configure: perl: not found >>>>> ./configure: perl: not found >>>>> ./configure: perl: not found >>>>> ./configure: perl: not found >>>>> ./configure: perl: not found >>>>> ./configure: perl: not found >>>>> ./configure: perl: not found >>>>> ./configure: perl: not found >>>>> >>>>> >>>>> *** Configuration summary for nagios 2.0b4 08-02-2005 ***: >>>>> >>>>> General Options: >>>>> ------------------------- >>>>> Nagios executable: nagios >>>>> Nagios user/group: nagios,nagios >>>>> Command user/group: nagios,nagios >>>>> Embedded Perl: no >>>>> Event Broker: yes >>>>> Install ${prefix}: /usr/local/nagios >>>>> Lock file: ${prefix}/var/nagios.lock >>>>> Init directory: /etc/init.d >>>>> Host OS: solaris2.6 >>>>> .... >>>>> Im going to install now perl and see if it will create the >> location.hfile. >>>>> Thanks! >>>>> eli >>>>> >>>>> On 8/31/05, Subhendu Ghosh wrote: >>>>>> >>>>>> On Wed, 31 Aug 2005, Elizar M. Palad wrote: >>>>>> >>>>>>> I rerun ./configure --prefix=/usr/local/nagios >>>>>> --with-cgiurl=/nagios/cgi-bin >>>>>>> --with-htmlurl=/nagios --with-nagios-user=nagios >>>>>> --with-nagios-groupp=nagios >>>>>>> --with-gd-lib=/usr/local/lib >>>>>>> and summary is below: >>>>>>> *** Configuration summary for nagios 2.0b4 08-02-2005 ***: >>>>>>> >>>>>>> General Options: >>>>>>> ------------------------- >>>>>>> Nagios executable: nagios >>>>>>> Nagios user/group: nagios,nagios >>>>>>> Command user/group: nagios,nagios >>>>>>> Embedded Perl: no >>>>>>> Event Broker: yes >>>>>>> Install ${prefix}: /usr/local/nagios >>>>>>> Lock file: ${prefix}/var/nagios.lock >>>>>>> Init directory: /etc/init.d >>>>>>> Host OS: solaris2.6 >>>>>>> >>>>>>> Web Interface Options: >>>>>>> ------------------------ >>>>>>> HTML URL: http://localhost/nagios/ >>>>>>> CGI URL: http://localhost/nagios/cgi-bin/ >>>>>>> Traceroute (used by WAP): >>>>>>> >>>>>>> >>>>>>> Review the options above for accuracy. If they look okay, >>>>>>> type 'make all' to compile the main program and CGIs. >>>>>>> ------------------ >>>>>>> Looks ok isn't it? >>>>>>> then when i do 'make all', i have this error: >>>>>>> # make all >>>>>>> cd ./base && make >>>>>>> make: Fatal error: Don't know how to make target >>>>>> `../include/locations.h' >>>>>>> Current working directory /packages/nagios-2.0b4/base >>>>>>> *** Error code 1 >>>>>>> make: Fatal error: Command failed for target `all' >>>>>>> This is also my second time with this error. What I did was i copy >>>>>>> locations.h.in < >> http://locations.h.in> < >>>> http://locations.h.in> as >>>>>> locations.h :-) >>>>>>> and the compilation continues.. (was that not right? :) >>>>>>> thanks! >>>>>> >>>>>> >>>>>> NOOOOO - configure must have produced an error before the summary was >>>>>> printed. >>>>>> >>>>>> configure should create a locations.h from locations.h.in >> >>>> after >>>>>> appropriate substitutions. >>>>>> >>>>>> If the file is not being created properly - you may have a file >>>> permission >>>>>> issue. >>>>>> >>>>>> run "chown -R user:group nagios_src_dir/" >>>>>> >>>>>> replace the user and group with the user and group info fort he >> person >>>>>> running configure >>>>>> >>>>>> replace nagios_src_dir with the dir that contains the nagios src. >>>>>> >>>>>> Never copy a *.*.in file into a *.* file. >>>>>> >>>>>> >>>>>> >>>>>> -- >>>>>> >>>>>> -sg >>>>>> >>>>> >>>>> >>>>> >>>>> >>>> >>>> -- >>>> >>>> >>> >>> >>> >> >> -- >> >> >> >> ------------------------------------------------------- >> SF.Net email is Sponsored by the Better Software Conference & EXPO >> September 19-22, 2005 * San Francisco, CA * Development Lifecycle >> Practices >> Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA >> Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf >> _______________________________________________ >> Nagios-users mailing list >> Nagios-users at lists.sourceforge.net >> https://lists.sourceforge.net/lists/listinfo/nagios-users >> ::: Please include Nagios version, plugin version (-v) and OS when >> reporting any issue. >> ::: Messages without supporting info will risk being sent to /dev/null >> > > > > -- ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From m.borsani at it.net Thu Sep 1 10:42:28 2005 From: m.borsani at it.net (Marco Borsani) Date: Thu, 1 Sep 2005 10:42:28 +0200 Subject: Ranges for check_snmp Message-ID: Hi all ! I'd like to check some metrics on my firewall, like CPU but when I try to set the ranges for warning and critical values I receive "strange" answers. ./check_snmp -H HOSTADDRESS -o .1.3.6.1.4.1.9.9.109.1.1.1.1.3.1 -C public SNMP WARNING - 9 ./check_snmp -H HOSTADDRESS -o .1.3.6.1.4.1.9.9.109.1.1.1.1.3.1 -C public -w 15:24 -c 25:100 SNMP CRITICAL - *7* Why? What's my fault ? I need to receive a Warning over 15% and a Critical over 25%. Regards Marco Borsani Unix & Monitoring System Administrator Technical Operation Tel. +39 010 4310115 Fax +39 010 4327454 E-mail: m.borsani at IT.net ITnet S.r.l. - Direzione e Coordinamento di WIND Telecomunicazioni S.p.A. Internet Service Provider Sede legale: Via C.G.Viola, 48 - 00148 Roma Dir. Centrale e Amministrativa: Via Pacinotti, 39 16151 Genova (Italy) http://www.it.net mailto:info at IT.net _______________________________________________________________ Altre sedi ITnet: MILANO tel.: +39 02 30114900 info-milano at IT.net ROMA tel.: +39 06 83116707 info-roma at IT.net _______________________________________________________________ ITnet is associated to CIX (Commercial IP eXchange) and RIPE ITnet is associated to AIIP (Associazione Italiana Internet Providers) ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From justin.shore at sktbcs.com Thu Sep 1 17:48:40 2005 From: justin.shore at sktbcs.com (Justin Shore) Date: Thu, 1 Sep 2005 10:48:40 -0500 Subject: Ranges for check_snmp Message-ID: Here is what I'm using to check temp on a Cisco 6509. The syntax should be applicable in your scenario. check_command check_snmp!1.3.6.1.4.1.9.9.91.1.1.1.1.4.4001!COMMUNITY-STRING!27,30!30,150 My check_snmp is defined as: # 'check_snmp' command definition define command{ command_name check_snmp command_line $USER1$/check_snmp -H $HOSTADDRESS$ -o $ARG1$ -C $ARG2$ -w $ARG3$ -c $ARG4$ } This translates into: check_snmp -H $HOSTADDRESS -o 1.3.6.1.4.1.9.9.91.1.1.1.1.4.4001 -C COMMUNITY-STRING -w 27,30 -c 30,150 Justin > -----Original Message----- > From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users- > admin at lists.sourceforge.net] On Behalf Of Marco Borsani > Sent: Thursday, September 01, 2005 3:42 AM > To: NAGIOS > Subject: [Nagios-users] Ranges for check_snmp > Importance: High > > Hi all ! > > I'd like to check some metrics on my firewall, like CPU but when I try to > set the ranges for warning and critical values I receive "strange" > answers. > > ./check_snmp -H HOSTADDRESS -o .1.3.6.1.4.1.9.9.109.1.1.1.1.3.1 -C public > SNMP WARNING - 9 > > ./check_snmp -H HOSTADDRESS -o .1.3.6.1.4.1.9.9.109.1.1.1.1.3.1 -C public > -w > 15:24 -c 25:100 > SNMP CRITICAL - *7* > > Why? What's my fault ? > > I need to receive a Warning over 15% and a Critical over 25%. > > Regards > > Marco Borsani > Unix & Monitoring System Administrator > Technical Operation > Tel. +39 010 4310115 > Fax +39 010 4327454 > E-mail: m.borsani at IT.net > > ITnet S.r.l. - Direzione e Coordinamento di WIND Telecomunicazioni S.p.A. > Internet Service Provider > Sede legale: Via C.G.Viola, 48 - 00148 Roma > Dir. Centrale e Amministrativa: Via Pacinotti, 39 > 16151 Genova (Italy) > > http://www.it.net > mailto:info at IT.net > _______________________________________________________________ > Altre sedi ITnet: > MILANO tel.: +39 02 30114900 info-milano at IT.net > ROMA tel.: +39 06 83116707 info-roma at IT.net > _______________________________________________________________ > ITnet is associated to CIX (Commercial IP eXchange) and RIPE > ITnet is associated to AIIP (Associazione Italiana Internet Providers) > > > > > ------------------------------------------------------- > SF.Net email is Sponsored by the Better Software Conference & EXPO > September 19-22, 2005 * San Francisco, CA * Development Lifecycle > Practices > Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA > Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > > -- > No virus found in this incoming message. > Checked by AVG Anti-Virus. > Version: 7.0.344 / Virus Database: 267.10.17/84 - Release Date: 8/29/2005 > -- No virus found in this outgoing message. Checked by AVG Anti-Virus. Version: 7.0.344 / Virus Database: 267.10.17/84 - Release Date: 8/29/2005 ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From boinger at tradingtechnologies.com Thu Sep 1 15:53:28 2005 From: boinger at tradingtechnologies.com (jeff vier) Date: Thu, 01 Sep 2005 08:53:28 -0500 Subject: 404 Not Found In-Reply-To: <431634E2.40806@effective-it.co.uk> References: <431634E2.40806@effective-it.co.uk> Message-ID: <1125582808.31599.50.camel@chi101100.int.tt.local> On Wed, 2005-08-31 at 23:53 +0100, Lee Ball wrote: > I have to disagree here, I find it more awkward to have to scroll down > through what has already been written to view the new bits, especially so trim the junk, keep the necessary content. "[snip]" and "[chop]" are useful to denote significant removed content. > when sometimes the replies don't go in the >> format properly. that's a mail client issue, not a technique issue. If your mail client can't quote properly, use a better one. > I know to change the config of apache to /nagios/ (only after realising > the manual conflicted with itself). But if someone follows it word for > word then they might have an issue. So do your "duty" as a good Nagios mailing list citizen and post a follow-up summary. You needn't include *any* context there, in many cases. Context replying isn't as pivotal to understanding the logic on single-subject, short-reply threads *while you're engaged in the conversation*. However, when these messages are archived and a future user is searching them, proper context is vital to an efficient evaluation of the content. Only since the advent of Outlook has this lazy top-post format come into any amount of vogue. Previous to that horrid mail client, most (all?) mail clients and newsreaders defaulted to quoting (properly) and starting the reply at the bottom. Hope this clarifies things. I will say I am very glad that top-posting is considered a faux pas on this list. So many lists these days don't take a stance and it irritates me considerably. -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 189 bytes Desc: This is a digitally signed message part URL: From drew.cullis at gwl.com Thu Sep 1 16:52:29 2005 From: drew.cullis at gwl.com (Drew Cullis) Date: Thu, 01 Sep 2005 08:52:29 -0600 Subject: problems with nagios.cmd and apache 2.0.5 & above Message-ID: <431715AD.2080105@gwl.com> Greetings all; I am in the process of upgrading our development server along with Nagios and am running into a problem getting nagios to work with Apache. I am running Red Hat Enterprise Server which has Apache 2.0.52 by default. I found this post from last year but it doesn't look like there was resolution to his problem, which is the same as mine. https://sourceforge.net/mailarchive/message.php?msg_id=10405076 For your dining and dancing pleasure, here is the error that is generated when you try to run an external command. Error: Could not stat() command file '/usr/local/nagios/var/rw/nagios.cmd'! The external command file may be missing, Nagios may not be running, and/or Nagios may not be checking external commands. An error occurred while attempting to commit your command for processing Nagios does work like a charm on my workstation running Apache 2.0.46, no issues whatsoever straight out of the box. It looks like the issue is with Apache 2.0.5 and above. Has anyone run into this problem also and found a solution? Here are the pertinent details. RH ES4 Apache 2.0.52 Nagios 2.0b4 Nagios Plugins 1.4 Thanks for the help... -Drew ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mailings at good-it.com Thu Sep 1 19:07:48 2005 From: mailings at good-it.com (Johan Barelds) Date: Thu, 1 Sep 2005 19:07:48 +0200 Subject: Windows Tray utility Message-ID: <200509011907.48935.mailings@good-it.com> Hi all, For BigBrother there is a very handy utily which you can install on your windows PC in the tray. When your monitoring page turn's into warning or critical it get's the same color and a box pop's up telling you what server/services are causing problems. Can someone tell me if there is something for Nagios? I checked Nagiosexchange but couldn't find anything. Thanks for any reply! -- Kind Regards / Met vriendelijke groet, Johan Barelds Good-IT! Tel.+31(0)70-3965230 Strijplaan 320 Mob.+31(0)6-54253750 2285 HZ Rijswijk(ZH) j.barelds at good-it.com http://www.good-it.com ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From robmossrm at aol.com Thu Sep 1 19:19:12 2005 From: robmossrm at aol.com (Rob Moss) Date: Thu, 01 Sep 2005 18:19:12 +0100 Subject: statusmap.cgi can't find libgd.so.2 in /usr/local/lib In-Reply-To: <20050829184002.19448.qmail@web54712.mail.yahoo.com> References: <20050829184002.19448.qmail@web54712.mail.yahoo.com> Message-ID: <43173810.9010008@aol.com> Hi John, It appears to me that your system library paths are not looking in /usr/local/lib. There may be a symbolic link in /usr/lib for freetype.so pointing into /usr/local/lib To resolve the problem, you can add the line /usr/local/lib Into /etc/ld.so.conf and run ldconfig Which should then fix the problem. if not, then please provide the output of the following commands: ldd ls -la /usr/local/lib/libgd* cat /etc/ld.so.conf echo $LD_LIBRARY_PATH As far as I'm aware, statusmap.cgi is part of the core of nagios, and not part of the plugins, and would have been compiled with the core of nagios. Cheers rob. John Christian wrote: >Hello, > >I've RTFM, Googled, and checked the list archives but >can't seem to really fix this issue. Any assistance >would be very appreciated! > >It seems that statusmap.cgi in nagios-plugins-1.4.1 is >not finding libgd.so.2 in /usr/local/lib. If I copy >libgd.so.2 from /usr/local/lib to /usr/lib, >nagios-plugins finds it just fine and statusmap.cgi >works fine. > >How do I get nagios-plugins-1.4.1 to "see" the >libgd.so.2 library in /usr/local/lib? > >BTW: I did an ldd of all the Nagios CGI's and there >are other CGI's that DO find other libraries in >/usr/local/lib. For example, trends.cgi found >libfreetype.so.6 in /usr/local/lib just fine. Seems >strange that statusmap.cgi doesn't find what it needs >in /usr/local/lib. > >Nagios 2.0b4 >Linux foo.foo.com 2.4.21-27.0.2.ELsmp #1 SMP Wed Jan >13 22:32:42 EST 2005 i686 i686 i386 GNU/Linux >GNU Make version 3.79.1, by Richard Stallman and >Roland McGrath. Built for i386-redhat-linux-gnu > >TIA! >-John > > -- Rob Moss Unix Systems Admin Hosting & DB Operations Hammersmith, London, UK ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Thu Sep 1 19:31:56 2005 From: marc at ena.com (Marc Powell) Date: Thu, 1 Sep 2005 12:31:56 -0500 Subject: problems with nagios.cmd and apache 2.0.5 & above Message-ID: > -----Original Message----- > From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users- > admin at lists.sourceforge.net] On Behalf Of Drew Cullis > Sent: Thursday, September 01, 2005 9:52 AM > To: Nagios-users; Drew (Work) > Subject: [Nagios-users] problems with nagios.cmd and apache 2.0.5 & above > > Greetings all; > I am in the process of upgrading our development server along with > Nagios and am running into a problem getting nagios to work with > Apache. I am running Red Hat Enterprise Server > which has Apache 2.0.52 by default. I found this post from last year but > it doesn't look like there was resolution to his problem, which is the > same as mine. > > https://sourceforge.net/mailarchive/message.php?msg_id=10405076 > > For your dining and dancing pleasure, here is the error that is > generated when you try to run an external command. > > Error: Could not stat() command file > '/usr/local/nagios/var/rw/nagios.cmd'! > The external command file may be missing, Nagios may not be running, > and/or Nagios may not be checking external commands. > An error occurred while attempting to commit your command for processing > > Nagios does work like a charm on my workstation running Apache 2.0.46, > no issues whatsoever straight out of the box. It looks like the issue is > with Apache 2.0.5 and above. IMHO, it would be surprising if this were an apache issue. Have you thoroughly eliminated SELinux permission as a potential cause and verified permissions on nagios.cmd and the directories above it? Do you find references to nagios.cmd in /var/log/messages? Does the apache error_log provide any additional information? -- Marc ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Millard.Matt at principal.com Thu Sep 1 19:37:26 2005 From: Millard.Matt at principal.com (Millard, Matt) Date: Thu, 1 Sep 2005 12:37:26 -0500 Subject: Windows Tray utility Message-ID: <6201DF063335254BA0D6AA7053D101170A62A73D@pfgdsmmbx006.principalusa.corp.principal.com> Search Nagios Exchange for "tray" and you'll find 3 of them. I personally use Ntray and am happy with it. Matt -----Original Message----- For BigBrother there is a very handy utily which you can install on your windows PC in the tray. When your monitoring page turn's into warning or critical it get's the same color and a box pop's up telling you what server/services are causing problems. Can someone tell me if there is something for Nagios? I checked Nagiosexchange but couldn't find anything. -----Message Disclaimer----- This e-mail message is intended only for the use of the individual or entity to which it is addressed, and may contain information that is privileged, confidential and exempt from disclosure under applicable law. If you are not the intended recipient, any dissemination, distribution or copying of this communication is strictly prohibited. If you have received this communication in error, please notify us immediately by reply email to Connect at principal.com and delete or destroy all copies of the original message and attachments thereto. Email sent to or from the Principal Financial Group or any of its member companies may be retained as required by law or regulation. Nothing in this message is intended to constitute an Electronic signature for purposes of the Uniform Electronic Transactions Act (UETA) or the Electronic Signatures in Global and National Commerce Act ("E-Sign") unless a specific statement to the contrary is included in this message. ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From sghosh at sghosh.org Thu Sep 1 18:45:31 2005 From: sghosh at sghosh.org (Subhendu Ghosh) Date: Thu, 1 Sep 2005 12:45:31 -0400 (EDT) Subject: problems with nagios.cmd and apache 2.0.5 & above In-Reply-To: <431715AD.2080105@gwl.com> References: <431715AD.2080105@gwl.com> Message-ID: On Thu, 1 Sep 2005, Drew Cullis wrote: > Greetings all; > I am in the process of upgrading our development server along with Nagios and > am running into a problem getting nagios to work with Apache. I am running > Red Hat Enterprise Server > which has Apache 2.0.52 by default. I found this post from last year but it > doesn't look like there was resolution to his problem, which is the same as > mine. > > https://sourceforge.net/mailarchive/message.php?msg_id=10405076 > > For your dining and dancing pleasure, here is the error that is generated > when you try to run an external command. > > Error: Could not stat() command file '/usr/local/nagios/var/rw/nagios.cmd'! > The external command file may be missing, Nagios may not be running, and/or > Nagios may not be checking external commands. > An error occurred while attempting to commit your command for processing > > Nagios does work like a charm on my workstation running Apache 2.0.46, no > issues whatsoever straight out of the box. It looks like the issue is with > Apache 2.0.5 and above. > Has anyone run into this problem also and found a solution? Here are the > pertinent details. > > RH ES4 > Apache 2.0.52 > Nagios 2.0b4 > Nagios Plugins 1.4 > > Thanks for the help... > > -Drew Assuming Nagios is running and it created the named pipe... Other than Apache - do you have SELinux enabled? the external command would be running as apache user and have to write to the pipe you may have to apply http context to the file... -- -sg ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From td3201 at gmail.com Thu Sep 1 20:13:20 2005 From: td3201 at gmail.com (Terry) Date: Thu, 1 Sep 2005 13:13:20 -0500 Subject: fork errors Message-ID: <8ee0610105090111133989e4e1@mail.gmail.com> Hello, I have been having this issue for quite some time. For some unknown reason, nagios stops performing checks with these errors: [1125536952] Warning: The check of service 'PING' on host 'hostname' could not be performed due to a fork() error. The check will be rescheduled. All checks fail like this until nagios is restarted. When this problem is occuring I can run the service checks manually both as the nagios user and as the root user. There are no resource problems that I can see at the time. We do not appear to be hitting a limit with open files or anything like that either. The nagios mirrors the root user in that area. What could be wrong? Thanks! ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From williamw at xeye.com Thu Sep 1 20:14:20 2005 From: williamw at xeye.com (William Wang) Date: Thu, 1 Sep 2005 14:14:20 -0400 Subject: nagios pooling question Message-ID: Hi Gurus, Can someone tell me how Nagios does pooling when it connect to the NT or UNIX agents? Are TCP or UDP ports used? What's the port numbers? I want to user Nagios as a central monitoring server and have angents and clients installed on the NT/UNIX boxes in the local LAN and remote side connecting to local LAN by using site-to-site VPN, safe@ device. Any solution is more than welcome. Thanks a lot, William ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From drew.cullis at gwl.com Thu Sep 1 19:34:52 2005 From: drew.cullis at gwl.com (Drew Cullis) Date: Thu, 01 Sep 2005 11:34:52 -0600 Subject: problems with nagios.cmd and apache 2.0.5 & above In-Reply-To: References: <431715AD.2080105@gwl.com> Message-ID: <43173BBC.9000001@gwl.com> I ran /usr/sbin/sestatus -v and received this bit of info from the output (there was more but I included only these lines). Would disabling SELinux solve the problem? SELinux status: enabled SELinuxfs mount: /selinux Current mode: enforcing Mode from config file: enforcing Policy version: 18 Policy from config file:targeted Subhendu Ghosh wrote: > On Thu, 1 Sep 2005, Drew Cullis wrote: > >> Greetings all; >> I am in the process of upgrading our development server along with >> Nagios and am running into a problem getting nagios to work with >> Apache. I am running Red Hat Enterprise Server >> which has Apache 2.0.52 by default. I found this post from last year >> but it doesn't look like there was resolution to his problem, which >> is the same as mine. >> >> https://sourceforge.net/mailarchive/message.php?msg_id=10405076 >> >> For your dining and dancing pleasure, here is the error that is >> generated when you try to run an external command. >> >> Error: Could not stat() command file >> '/usr/local/nagios/var/rw/nagios.cmd'! >> The external command file may be missing, Nagios may not be running, >> and/or Nagios may not be checking external commands. >> An error occurred while attempting to commit your command for processing >> >> Nagios does work like a charm on my workstation running Apache >> 2.0.46, no issues whatsoever straight out of the box. It looks like >> the issue is with Apache 2.0.5 and above. >> Has anyone run into this problem also and found a solution? Here are >> the pertinent details. >> >> RH ES4 >> Apache 2.0.52 >> Nagios 2.0b4 >> Nagios Plugins 1.4 >> >> Thanks for the help... >> >> -Drew > > > Assuming Nagios is running and it created the named pipe... > > Other than Apache - do you have SELinux enabled? the external command > would be running as apache user and have to write to the pipe you may > have to apply http context to the file... > ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Thu Sep 1 19:35:51 2005 From: marc at ena.com (Marc Powell) Date: Thu, 1 Sep 2005 12:35:51 -0500 Subject: Windows Tray utility Message-ID: > -----Original Message----- > From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users- > admin at lists.sourceforge.net] On Behalf Of Johan Barelds > Sent: Thursday, September 01, 2005 12:08 PM > To: nagios-users at lists.sourceforge.net > Subject: [Nagios-users] Windows Tray utility > > Hi all, > > For BigBrother there is a very handy utily which you can install on your > windows PC in the tray. > When your monitoring page turn's into warning or critical it get's the > same > color and a box pop's up telling you what server/services are causing > problems. > > Can someone tell me if there is something for Nagios? > I checked Nagiosexchange but couldn't find anything. http://www.nagiosexchange.org/Frontends.37.0.html More specifically -- http://www.nagiosexchange.org/Frontends.37.0.html?&tx_netnagext_pi1[p_vi ew]=165 http://www.nagiosexchange.org/Frontends.37.0.html?&tx_netnagext_pi1[p_vi ew]=168&tx_netnagext_pi1[page]=10%3A10 http://www.nagiosexchange.org/Frontends.37.0.html?&tx_netnagext_pi1[p_vi ew]=233&tx_netnagext_pi1[page]=10%3A10 -- Marc ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mailings at good-it.com Thu Sep 1 20:58:23 2005 From: mailings at good-it.com (Johan Barelds) Date: Thu, 1 Sep 2005 20:58:23 +0200 Subject: Windows Tray utility In-Reply-To: <59B15593F41BD24591D59436E7226EAD0268D14B@Khiphx2.khimetrics.com> References: <59B15593F41BD24591D59436E7226EAD0268D14B@Khiphx2.khimetrics.com> Message-ID: <200509012058.23357.mailings@good-it.com> Thanks guys! I have been overseeing a lot in NagiosExchange i see...:-) Great stuff! Grz. Johan Op donderdag 1 september 2005 20:11, schreef Nathan Oyler: > http://www.nagiosexchange.org/Frontends.37.0.html?&tx_netnagext_pi1[p_vi > ew]=168 -- Kind Regards / Met vriendelijke groet, Johan Barelds Good-IT! Tel.+31(0)70-3965230 Strijplaan 320 Mob.+31(0)6-54253750 2285 HZ Rijswijk(ZH) j.barelds at good-it.com http://www.good-it.com ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From f1216 at yahoo.com Thu Sep 1 19:53:03 2005 From: f1216 at yahoo.com (Fred) Date: Thu, 1 Sep 2005 10:53:03 -0700 (PDT) Subject: Service checks pending forever in distributed monitoring configuration Message-ID: <20050901175303.4941.qmail@web31903.mail.mud.yahoo.com> I have a 1000+ node system plus a number of switches etc that are all monitored by Nagios. I'm running 2.0b3. Our configuration is generated automatically based on the clusters configuration and in smaller configurations has no issues. Recently, nagios started delaying execution of active service checks. I have 5 nagios monitors reporting via nsca to a 6th nagios master (which also monitors 1/6th of the cluster). I removed all the retention caches for all the monitor nodes and restarted. Nagios then reports that the next service check is scheduled for hours later (when it should be fairly close). Attached is output from nagiostats. There are quite a few services, most all are passive checks with each monitor node running some active checks that will push data to the FIFO where it is then picked up and reported on a per-node/service basis. The pending checks do not execute even when the time passes. The monitor nodes are working just fine, the master node which is configured to obsessing is disabled (on the master) and freshness checking is enabled. There is nothing in nagios.log other then stale check messages. Following is an example service description from a service that is not getting scheduled: define service{ use nagios host_name nh name slurmMonitor service_description Slurm Monitor active_checks_enabled 1 check_command check_slurm register 1 } and the template: # Generic template for services define service{ use generic-service ; default service name nagios normal_check_interval 5 retry_check_interval 2 check_period 24x7 is_volatile 0 max_check_attempts 3 notification_interval 240 notification_period 24x7 notification_options w,u,c,r contact_groups admins register 0 } and finally, the generic-service template: # Generic service definition template define service{ name generic-service ; The 'name' of this service template, referenced in other service definitions active_checks_enabled 1 ; Active service checks are enabled passive_checks_enabled 1 ; Passive service checks are enabled/accepted parallelize_check 1 ; Active service checks should be parallelized (disabling this can lead to major performance problems) obsess_over_service 1 ; We should obsess over this service (if necessary) check_freshness 0 ; Default is to NOT check service 'freshness' notifications_enabled 1 ; Service notifications are enabled event_handler_enabled 1 ; Service event handler is enabled flap_detection_enabled 1 ; Flap detection is enabled process_perf_data 1 ; Process performance data retain_status_information 1 ; Retain status information across program restarts retain_nonstatus_information 1 ; Retain non-status information across program restarts register 0 ; DONT REGISTER THIS DEFINITION - ITS NOT A REAL SERVICE, JUST A TEMPLATE! } Clocks are correct and synchronized on the system. Nagios Stats 2.0b3 Copyright (c) 2003-2005 Ethan Galstad (www.nagios.org) Last Modified: 04-03-2005 License: GPL CURRENT STATUS DATA ---------------------------------------------------- Status File: /opt/hptc/nagios/var/status.log Status File Age: 0d 0h 0m 1s Status File Version: 2.0b3 Program Running Time: 0d 48h 0m 56s Total Services: 10388 Services Checked: 8472 Services Scheduled: 246 Active Service Checks: 4774 Passive Service Checks: 5614 Total Service State Change: 0.000 / 63.550 / 2.210 % Active Service Latency: 0.000 / 2714.925 / 1220.973 % Active Service Execution Time: 0.000 / 180.065 / 0.119 sec Active Service State Change: 0.000 / 17.830 / 1.222 % Active Services Last 1/5/15/60 min: 0 / 0 / 0 / 4 Passive Service State Change: 0.000 / 63.550 / 3.050 % Passive Services Last 1/5/15/60 min: 0 / 440 / 2566 / 4724 Services Ok/Warn/Unk/Crit: 7420 / 2866 / 0 / 102 Services Flapping: 0 Services In Downtime: 0 Total Hosts: 1094 Hosts Checked: 1030 Hosts Scheduled: 0 Active Host Checks: 1094 Passive Host Checks: 0 Total Host State Change: 0.000 / 0.000 / 0.000 % Active Host Latency: 0.000 / 0.000 / 0.000 % Active Host Execution Time: 0.000 / 0.000 / 0.000 sec Active Host State Change: 0.000 / 0.000 / 0.000 % Active Hosts Last 1/5/15/60 min: 0 / 0 / 0 / 0 Passive Host State Change: 0.000 / 0.000 / 0.000 % Passive Hosts Last 1/5/15/60 min: 0 / 0 / 0 / 0 Hosts Up/Down/Unreach: 1094 / 0 / 0 Hosts Flapping: 0 Hosts In Downtime: 0 Anyone have any suggestions as to what to look for next? If I force the scheduling of the service, it eventually gets scheduled and runs, it does update the pending time in the web display right away. Thanks in advance for any insight. -FredC ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From drew.cullis at gwl.com Thu Sep 1 19:38:47 2005 From: drew.cullis at gwl.com (Drew Cullis) Date: Thu, 01 Sep 2005 11:38:47 -0600 Subject: problems with nagios.cmd and apache 2.0.5 & above In-Reply-To: References: Message-ID: <43173CA7.3030306@gwl.com> Now that I'm a little more familiar with SELinux, I will look into it more thoroughly instead of Apache. I did verify the permissions on the directories before I sent my original post. I will double check them just to make sure. Thanks... Marc Powell wrote: > > >>-----Original Message----- >>From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users- >>admin at lists.sourceforge.net] On Behalf Of Drew Cullis >>Sent: Thursday, September 01, 2005 9:52 AM >>To: Nagios-users; Drew (Work) >>Subject: [Nagios-users] problems with nagios.cmd and apache 2.0.5 & >> >> >above > > >>Greetings all; >>I am in the process of upgrading our development server along with >>Nagios and am running into a problem getting nagios to work with >>Apache. I am running Red Hat Enterprise Server >>which has Apache 2.0.52 by default. I found this post from last year >> >> >but > > >>it doesn't look like there was resolution to his problem, which is the >>same as mine. >> >>https://sourceforge.net/mailarchive/message.php?msg_id=10405076 >> >>For your dining and dancing pleasure, here is the error that is >>generated when you try to run an external command. >> >>Error: Could not stat() command file >>'/usr/local/nagios/var/rw/nagios.cmd'! >>The external command file may be missing, Nagios may not be running, >>and/or Nagios may not be checking external commands. >>An error occurred while attempting to commit your command for >> >> >processing > > >>Nagios does work like a charm on my workstation running Apache 2.0.46, >>no issues whatsoever straight out of the box. It looks like the issue >> >> >is > > >>with Apache 2.0.5 and above. >> >> > >IMHO, it would be surprising if this were an apache issue. Have you >thoroughly eliminated SELinux permission as a potential cause and >verified permissions on nagios.cmd and the directories above it? Do you >find references to nagios.cmd in /var/log/messages? Does the apache >error_log provide any additional information? > >-- >Marc > > >------------------------------------------------------- >SF.Net email is Sponsored by the Better Software Conference & EXPO >September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices >Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA >Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf >_______________________________________________ >Nagios-users mailing list >Nagios-users at lists.sourceforge.net >https://lists.sourceforge.net/lists/listinfo/nagios-users >::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. >::: Messages without supporting info will risk being sent to /dev/null > > ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From sghosh at sghosh.org Thu Sep 1 19:59:20 2005 From: sghosh at sghosh.org (Subhendu Ghosh) Date: Thu, 1 Sep 2005 13:59:20 -0400 (EDT) Subject: problems with nagios.cmd and apache 2.0.5 & above In-Reply-To: <43173BBC.9000001@gwl.com> References: <431715AD.2080105@gwl.com> <43173BBC.9000001@gwl.com> Message-ID: On Thu, 1 Sep 2005, Drew Cullis wrote: > I ran /usr/sbin/sestatus -v and received this bit of info from the output > (there was more but I included only these lines). Would disabling SELinux > solve the problem? > > SELinux status: enabled > SELinuxfs mount: /selinux > Current mode: enforcing > Mode from config file: enforcing > Policy version: 18 > Policy from config file:targeted as root - run: audit2allow -l -i /var/log/messages -v if there are references to nagios.cmd - you have a SELinux perms problem. Either disable it, or allow permissive, or add http context to nagios/var/* -sg > > Subhendu Ghosh wrote: > >> On Thu, 1 Sep 2005, Drew Cullis wrote: >> >>> Greetings all; >>> I am in the process of upgrading our development server along with Nagios >>> and am running into a problem getting nagios to work with Apache. I am >>> running Red Hat Enterprise Server >>> which has Apache 2.0.52 by default. I found this post from last year but >>> it doesn't look like there was resolution to his problem, which is the >>> same as mine. >>> >>> https://sourceforge.net/mailarchive/message.php?msg_id=10405076 >>> >>> For your dining and dancing pleasure, here is the error that is generated >>> when you try to run an external command. >>> >>> Error: Could not stat() command file >>> '/usr/local/nagios/var/rw/nagios.cmd'! >>> The external command file may be missing, Nagios may not be running, >>> and/or Nagios may not be checking external commands. >>> An error occurred while attempting to commit your command for processing >>> >>> Nagios does work like a charm on my workstation running Apache 2.0.46, no >>> issues whatsoever straight out of the box. It looks like the issue is with >>> Apache 2.0.5 and above. >>> Has anyone run into this problem also and found a solution? Here are the >>> pertinent details. >>> >>> RH ES4 >>> Apache 2.0.52 >>> Nagios 2.0b4 >>> Nagios Plugins 1.4 >>> >>> Thanks for the help... >>> >>> -Drew >> >> >> Assuming Nagios is running and it created the named pipe... >> >> Other than Apache - do you have SELinux enabled? the external command >> would be running as apache user and have to write to the pipe you may have >> to apply http context to the file... >> > -- ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From noyler at khimetrics.com Thu Sep 1 20:11:06 2005 From: noyler at khimetrics.com (Nathan Oyler) Date: Thu, 1 Sep 2005 11:11:06 -0700 Subject: Windows Tray utility Message-ID: <59B15593F41BD24591D59436E7226EAD0268D14B@Khiphx2.khimetrics.com> Ntray on Nagios exchange works well. http://www.nagiosexchange.org/Frontends.37.0.html?&tx_netnagext_pi1[p_vi ew]=168 > -----Original Message----- > From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users- > admin at lists.sourceforge.net] On Behalf Of Johan Barelds > Sent: Thursday, September 01, 2005 10:08 AM > To: nagios-users at lists.sourceforge.net > Subject: [Nagios-users] Windows Tray utility > > Hi all, > > For BigBrother there is a very handy utily which you can install on your > windows PC in the tray. > When your monitoring page turn's into warning or critical it get's the > same > color and a box pop's up telling you what server/services are causing > problems. > > Can someone tell me if there is something for Nagios? > I checked Nagiosexchange but couldn't find anything. > > Thanks for any reply! > > -- > Kind Regards / Met vriendelijke groet, > > Johan Barelds Good-IT! > Tel.+31(0)70-3965230 Strijplaan 320 > Mob.+31(0)6-54253750 2285 HZ Rijswijk(ZH) > j.barelds at good-it.com http://www.good-it.com > > > ------------------------------------------------------- > SF.Net email is Sponsored by the Better Software Conference & EXPO > September 19-22, 2005 * San Francisco, CA * Development Lifecycle > Practices > Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA > Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ygonzales at medassets.com Thu Sep 1 20:08:08 2005 From: ygonzales at medassets.com (Gonzales, Youn) Date: Thu, 1 Sep 2005 13:08:08 -0500 Subject: statusmap icons Message-ID: <99CF04974931C548B2BF20CD898FCBA7AF4538@uscpgmedexch01.medassets.com> I am running 2.0b4 on Fedora 4. I can't seem to get the statusmap icons to work properly. define hostextinfo{ host_name uscpgls1010 icon_image network_switch.gif statusmap_image network_switch.gif } The icons show up in the status views - ie host detail - and when I float over the device on the statusmap the icons are in the top left corner of the popup. But, the icons on the status map are all the unknown.gif icons. I can browse all of the icons, so it is not an apache or permissions issue. Any suggestions? "The information transmitted is intended only for the person or entity to which it is addressed and may contain confidential, proprietary, and/or privileged material. Any review, retransmission, dissemination or other use of, or taking of any action in reliance upon this information by persons or entities other than the intended recipient is prohibited. If you received this in error, please contact the sender and delete the material from all computers" ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From al at its-lehmann.de Thu Sep 1 22:30:02 2005 From: al at its-lehmann.de (Arno Lehmann) Date: Thu, 01 Sep 2005 22:30:02 +0200 Subject: statusmap icons In-Reply-To: <99CF04974931C548B2BF20CD898FCBA7AF4538@uscpgmedexch01.medassets.com> References: <99CF04974931C548B2BF20CD898FCBA7AF4538@uscpgmedexch01.medassets.com> Message-ID: <431764CA.7000304@its-lehmann.de> Hi, Gonzales, Youn wrote: > I am running 2.0b4 on Fedora 4. I can't seem to get the statusmap icons > to work properly. > > define hostextinfo{ > host_name uscpgls1010 > icon_image network_switch.gif > statusmap_image network_switch.gif > } > > The icons show up in the status views - ie host detail - and when I > float over the device on the statusmap the icons are in the top left > corner of the popup. But, the icons on the status map are all the > unknown.gif icons. > > I can browse all of the icons, so it is not an apache or permissions > issue. Any suggestions? Yes, try using gd2 images. his worked here, I don't know what the manual has to say... Arno > "The information transmitted is intended only for the person or entity to which it is addressed and may contain confidential, proprietary, and/or privileged material. Any review, retransmission, dissemination or other use of, or taking of any action in reliance upon this information by persons or entities other than the intended recipient is prohibited. If you received this in error, please contact the sender and delete the material from all computers" > > > > > ------------------------------------------------------- > SF.Net email is Sponsored by the Better Software Conference & EXPO > September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices > Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA > Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -- IT-Service Lehmann al at its-lehmann.de Arno Lehmann http://www.its-lehmann.de ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Jason.Truong at plumtree.com Thu Sep 1 21:22:16 2005 From: Jason.Truong at plumtree.com (Jason Truong) Date: Thu, 1 Sep 2005 12:22:16 -0700 Subject: An email on acknowledge, and onlyonacknowledge? Nagios and RT Message-ID: <2B5E62F297571B49871BC8B7C587A2BEF77B87@CORPEXCH10.plumtree.com> Hello folks... We just implemented RT at our site but its on a different box than the Nagios server. Where can I find this patch to Nagios for RT. The site I am looking at: http://archives.free.net.ph/message/20050613.151813.d26c3cb0.en.html Only works if Nagios and RT are on the same machine. Any help is appreciated. Thank you, Jason Truong Plumtree Software 500 Sansome Street Suite 100 San Francisco, CA 94111 email: jason.truong at plumtree.com phone: (415) 399-7006 -----Original Message----- From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users-admin at lists.sourceforge.net] On Behalf Of Chris Wilson Sent: Tuesday, July 26, 2005 8:20 AM To: Nathan Oyler Cc: Nagios Users Subject: RE: [Nagios-users] An email on acknowledge, and onlyonacknowledge? Nagios and RT Hi Nathan, > Because if when there's a problem, a notification is sent. A ticket is > opened. > > Now an hour goes by, and the problem still exists, so another > notification is sent. Another ticket is opened. > > Now a recovery message is sent. Another ticket is opened. It's easy enough to write your handler so that no email is sent unless the message type is ACKNOWLEDGEMENT. > There's already a patch to Nagios to call the RT api during > acknowledgements, but my RT machine isn't my Nagios machine, and I don't > know enough of anything other than perl to edit it to do as I wish. perl -i -pe 's/localhost/my.rt.server/g' api-script.c :-) > There's also writing handlers to merge tickets when they come in, if new > alert comes in, search for if a ticket is created for this host, grab > that ticket number, send the new alert to comment on the ticket. This is much trickier. Basically you need to write a mail handler that gets the responses from RT, captures the ticket number, and associates it in some way with a permanent record of the problem (maybe hostname + service name). It's been discussed in the past, you could check the archives. But it sounds like more than you need. > This is how I'm leaning if I find nothing else out, but I'm thinking > just on an acknowledge if a ticket was open, that would make sense. Cheers, Chris. -- (aidworld) chris wilson | chief engineer (chris at aidworld.org) ------------------------------------------------------- SF.Net email is sponsored by: Discover Easy Linux Migration Strategies from IBM. Find simple to follow Roadmaps, straightforward articles, informative Webcasts and more! Get everything you need to get up to speed, fast. http://ads.osdn.com/?ad_id=7477&alloc_id=16492&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Praveenm at niit.com Thu Sep 1 22:54:10 2005 From: Praveenm at niit.com (Praveen Muthyala Manohar) Date: Thu, 1 Sep 2005 16:54:10 -0400 Subject: Nagios - J2EE server Message-ID: Hi All, Is there a plug-in available to check the health of application server (Web Logic or any App Server) instance? Regards Praveen M M ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ygonzales at medassets.com Thu Sep 1 23:06:02 2005 From: ygonzales at medassets.com (Gonzales, Youn) Date: Thu, 1 Sep 2005 16:06:02 -0500 Subject: statusmap icons Message-ID: <99CF04974931C548B2BF20CD898FCBA7AF4631@uscpgmedexch01.medassets.com> I am not able to view gd2 images in internet explorer. Is there a plugin or something I need to install? -----Original Message----- From: Arno Lehmann [mailto:al at its-lehmann.de] Sent: Thursday, September 01, 2005 3:30 PM To: Gonzales, Youn Cc: NAGIOS Subject: Re: [Nagios-users] statusmap icons Hi, Gonzales, Youn wrote: > I am running 2.0b4 on Fedora 4. I can't seem to get the statusmap icons > to work properly. > > define hostextinfo{ > host_name uscpgls1010 > icon_image network_switch.gif > statusmap_image network_switch.gif > } > > The icons show up in the status views - ie host detail - and when I > float over the device on the statusmap the icons are in the top left > corner of the popup. But, the icons on the status map are all the > unknown.gif icons. > > I can browse all of the icons, so it is not an apache or permissions > issue. Any suggestions? Yes, try using gd2 images. his worked here, I don't know what the manual has to say... Arno > "The information transmitted is intended only for the person or entity to which it is addressed and may contain confidential, proprietary, and/or privileged material. Any review, retransmission, dissemination or other use of, or taking of any action in reliance upon this information by persons or entities other than the intended recipient is prohibited. If you received this in error, please contact the sender and delete the material from all computers" > > > > > ------------------------------------------------------- > SF.Net email is Sponsored by the Better Software Conference & EXPO > September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices > Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA > Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -- IT-Service Lehmann al at its-lehmann.de Arno Lehmann http://www.its-lehmann.de "The information transmitted is intended only for the person or entity to which it is addressed and may contain confidential, proprietary, and/or privileged material. Any review, retransmission, dissemination or other use of, or taking of any action in reliance upon this information by persons or entities other than the intended recipient is prohibited. If you received this in error, please contact the sender and delete the material from all computers" ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ygonzales at medassets.com Thu Sep 1 23:31:27 2005 From: ygonzales at medassets.com (Gonzales, Youn) Date: Thu, 1 Sep 2005 16:31:27 -0500 Subject: statusmap icons Message-ID: <99CF04974931C548B2BF20CD898FCBA7AF4643@uscpgmedexch01.medassets.com> The GD2 images work on the statusmap, but do not work on the popup, so it looks like I can either have the image on the map or the image on the popup, but not both. -----Original Message----- From: Arno Lehmann [mailto:al at its-lehmann.de] Sent: Thursday, September 01, 2005 4:23 PM To: Gonzales, Youn Cc: NAGIOS Subject: Re: [Nagios-users] statusmap icons Hi, Gonzales, Youn wrote: > I am not able to view gd2 images in internet explorer. Is there a plugin > or something I need to install? No, you don't view them directly. They are used by libgd to create the statusmap image. You can simply convert one of your gifs to a gd2 format image, modify the configuration concerning hostextinfo to use the gd image, and see what happens. See the manual secion on extended information configuration for details, but note that my experience was that only gd images worked. I never tried to determine if that was a configuration error, an incorrect manual description, or anything else. With Nagios 1.2 and 2.0b3, by the way. Arno > -----Original Message----- > From: Arno Lehmann [mailto:al at its-lehmann.de] > Sent: Thursday, September 01, 2005 3:30 PM > To: Gonzales, Youn > Cc: NAGIOS > Subject: Re: [Nagios-users] statusmap icons > > Hi, > > Gonzales, Youn wrote: > > >>I am running 2.0b4 on Fedora 4. I can't seem to get the statusmap > > icons > >>to work properly. >> >>define hostextinfo{ >> host_name uscpgls1010 >> icon_image network_switch.gif >> statusmap_image network_switch.gif >> } >> >>The icons show up in the status views - ie host detail - and when I >>float over the device on the statusmap the icons are in the top left >>corner of the popup. But, the icons on the status map are all the >>unknown.gif icons. >> >>I can browse all of the icons, so it is not an apache or permissions >>issue. Any suggestions? > > > Yes, try using gd2 images. his worked here, I don't know what the manual > > has to say... > > Arno > > >>"The information transmitted is intended only for the person or entity > > to which it is addressed and may contain confidential, proprietary, > and/or privileged material. Any review, retransmission, dissemination or > other use of, or taking of any action in reliance upon this information > by persons or entities other than the intended recipient is prohibited. > If you received this in error, please contact the sender and delete the > material from all computers" > >> >> >> >>------------------------------------------------------- >>SF.Net email is Sponsored by the Better Software Conference & EXPO >>September 19-22, 2005 * San Francisco, CA * Development Lifecycle > > Practices > >>Agile & Plan-Driven Development * Managing Projects & Teams * Testing > > & QA > >>Security * Process Improvement & Measurement * > > http://www.sqe.com/bsce5sf > >>_______________________________________________ >>Nagios-users mailing list >>Nagios-users at lists.sourceforge.net >>https://lists.sourceforge.net/lists/listinfo/nagios-users >>::: Please include Nagios version, plugin version (-v) and OS when > > reporting any issue. > >>::: Messages without supporting info will risk being sent to /dev/null >> > > -- IT-Service Lehmann al at its-lehmann.de Arno Lehmann http://www.its-lehmann.de "The information transmitted is intended only for the person or entity to which it is addressed and may contain confidential, proprietary, and/or privileged material. Any review, retransmission, dissemination or other use of, or taking of any action in reliance upon this information by persons or entities other than the intended recipient is prohibited. If you received this in error, please contact the sender and delete the material from all computers" ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From adisharon at gmail.com Thu Sep 1 22:07:51 2005 From: adisharon at gmail.com (Adi Sharon) Date: Thu, 1 Sep 2005 20:07:51 +0000 (UTC) Subject: Service Checks in pending state - Take 2 References: Message-ID: Hello well it happened again. added some services and restarted nagios( one proc is running) and again all services pending and scheduled for tommorow nite in 00:00. the strange thing i also notice is that the status information does not contain the same time for all services. since there are only 10 services and concurrent checks is now 150 seems a little strange. but anyway nagios is not running any more only in pending state. i cleared the mysql tables ( all of them) but it didnt helped still all service checks are scheduled for tommorow. the nagios configuration + information is the same as i wrote. only added 2 more services. Please Help Adi ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From sghosh at sghosh.org Thu Sep 1 23:52:17 2005 From: sghosh at sghosh.org (Subhendu Ghosh) Date: Thu, 1 Sep 2005 17:52:17 -0400 (EDT) Subject: statusmap icons In-Reply-To: <99CF04974931C548B2BF20CD898FCBA7AF4643@uscpgmedexch01.medassets.com> References: <99CF04974931C548B2BF20CD898FCBA7AF4643@uscpgmedexch01.medassets.com> Message-ID: On Thu, 1 Sep 2005, Gonzales, Youn wrote: > The GD2 images work on the statusmap, but do not work on the popup, so > it looks like I can either have the image on the map or the image on the > popup, but not both. >From docs: http://nagios.sourceforge.net/docs/2_0/xodtemplate.html#hostextinfo define hostextinfo{ host_name host_name notes note_string notes_url url action_url url icon_image image_file icon_image_alt alt_string vrml_image image_file statusmap_image image_file 2d_coords x_coord,y_coord 3d_coords x_coord,y_coord,z_coord } Note: icon_image/vrml_image - should be png/gif/jpg statusmap_image - should be gd2 > > -----Original Message----- > From: Arno Lehmann [mailto:al at its-lehmann.de] > Sent: Thursday, September 01, 2005 4:23 PM > To: Gonzales, Youn > Cc: NAGIOS > Subject: Re: [Nagios-users] statusmap icons > > Hi, > > Gonzales, Youn wrote: > >> I am not able to view gd2 images in internet explorer. Is there a > plugin >> or something I need to install? > > No, you don't view them directly. They are used by libgd to create the > statusmap image. > > You can simply convert one of your gifs to a gd2 format image, modify > the configuration concerning hostextinfo to use the gd image, and see > what happens. See the manual secion on extended information > configuration for details, but note that my experience was that only gd > images worked. I never tried to determine if that was a configuration > error, an incorrect manual description, or anything else. > > With Nagios 1.2 and 2.0b3, by the way. > > Arno > >> -----Original Message----- >> From: Arno Lehmann [mailto:al at its-lehmann.de] >> Sent: Thursday, September 01, 2005 3:30 PM >> To: Gonzales, Youn >> Cc: NAGIOS >> Subject: Re: [Nagios-users] statusmap icons >> >> Hi, >> >> Gonzales, Youn wrote: >> >> >>> I am running 2.0b4 on Fedora 4. I can't seem to get the statusmap >> >> icons >> >>> to work properly. >>> >>> define hostextinfo{ >>> host_name uscpgls1010 >>> icon_image network_switch.gif >>> statusmap_image network_switch.gif >>> } >>> >>> The icons show up in the status views - ie host detail - and when I >>> float over the device on the statusmap the icons are in the top left >>> corner of the popup. But, the icons on the status map are all the >>> unknown.gif icons. >>> >>> I can browse all of the icons, so it is not an apache or permissions >>> issue. Any suggestions? >> >> >> Yes, try using gd2 images. his worked here, I don't know what the > manual >> >> has to say... >> >> Arno >> -- ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From sghosh at sghosh.org Thu Sep 1 23:15:56 2005 From: sghosh at sghosh.org (Subhendu Ghosh) Date: Thu, 1 Sep 2005 17:15:56 -0400 (EDT) Subject: statusmap icons In-Reply-To: <99CF04974931C548B2BF20CD898FCBA7AF4631@uscpgmedexch01.medassets.com> References: <99CF04974931C548B2BF20CD898FCBA7AF4631@uscpgmedexch01.medassets.com> Message-ID: On Thu, 1 Sep 2005, Gonzales, Youn wrote: > I am not able to view gd2 images in internet explorer. Is there a plugin > or something I need to install? > Nagios uses the gd2 images to generate a larger image - in a png format. gd2 images are not directly sent out. -sg > -----Original Message----- > From: Arno Lehmann [mailto:al at its-lehmann.de] > Sent: Thursday, September 01, 2005 3:30 PM > To: Gonzales, Youn > Cc: NAGIOS > Subject: Re: [Nagios-users] statusmap icons > > Hi, > > Gonzales, Youn wrote: > >> I am running 2.0b4 on Fedora 4. I can't seem to get the statusmap > icons >> to work properly. >> >> define hostextinfo{ >> host_name uscpgls1010 >> icon_image network_switch.gif >> statusmap_image network_switch.gif >> } >> >> The icons show up in the status views - ie host detail - and when I >> float over the device on the statusmap the icons are in the top left >> corner of the popup. But, the icons on the status map are all the >> unknown.gif icons. >> >> I can browse all of the icons, so it is not an apache or permissions >> issue. Any suggestions? > > Yes, try using gd2 images. his worked here, I don't know what the manual > > has to say... > > Arno > -- ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From al at its-lehmann.de Thu Sep 1 23:23:21 2005 From: al at its-lehmann.de (Arno Lehmann) Date: Thu, 01 Sep 2005 23:23:21 +0200 Subject: statusmap icons In-Reply-To: <99CF04974931C548B2BF20CD898FCBA7AF4631@uscpgmedexch01.medassets.com> References: <99CF04974931C548B2BF20CD898FCBA7AF4631@uscpgmedexch01.medassets.com> Message-ID: <43177149.2030506@its-lehmann.de> Hi, Gonzales, Youn wrote: > I am not able to view gd2 images in internet explorer. Is there a plugin > or something I need to install? No, you don't view them directly. They are used by libgd to create the statusmap image. You can simply convert one of your gifs to a gd2 format image, modify the configuration concerning hostextinfo to use the gd image, and see what happens. See the manual secion on extended information configuration for details, but note that my experience was that only gd images worked. I never tried to determine if that was a configuration error, an incorrect manual description, or anything else. With Nagios 1.2 and 2.0b3, by the way. Arno > -----Original Message----- > From: Arno Lehmann [mailto:al at its-lehmann.de] > Sent: Thursday, September 01, 2005 3:30 PM > To: Gonzales, Youn > Cc: NAGIOS > Subject: Re: [Nagios-users] statusmap icons > > Hi, > > Gonzales, Youn wrote: > > >>I am running 2.0b4 on Fedora 4. I can't seem to get the statusmap > > icons > >>to work properly. >> >>define hostextinfo{ >> host_name uscpgls1010 >> icon_image network_switch.gif >> statusmap_image network_switch.gif >> } >> >>The icons show up in the status views - ie host detail - and when I >>float over the device on the statusmap the icons are in the top left >>corner of the popup. But, the icons on the status map are all the >>unknown.gif icons. >> >>I can browse all of the icons, so it is not an apache or permissions >>issue. Any suggestions? > > > Yes, try using gd2 images. his worked here, I don't know what the manual > > has to say... > > Arno > > >>"The information transmitted is intended only for the person or entity > > to which it is addressed and may contain confidential, proprietary, > and/or privileged material. Any review, retransmission, dissemination or > other use of, or taking of any action in reliance upon this information > by persons or entities other than the intended recipient is prohibited. > If you received this in error, please contact the sender and delete the > material from all computers" > >> >> >> >>------------------------------------------------------- >>SF.Net email is Sponsored by the Better Software Conference & EXPO >>September 19-22, 2005 * San Francisco, CA * Development Lifecycle > > Practices > >>Agile & Plan-Driven Development * Managing Projects & Teams * Testing > > & QA > >>Security * Process Improvement & Measurement * > > http://www.sqe.com/bsce5sf > >>_______________________________________________ >>Nagios-users mailing list >>Nagios-users at lists.sourceforge.net >>https://lists.sourceforge.net/lists/listinfo/nagios-users >>::: Please include Nagios version, plugin version (-v) and OS when > > reporting any issue. > >>::: Messages without supporting info will risk being sent to /dev/null >> > > -- IT-Service Lehmann al at its-lehmann.de Arno Lehmann http://www.its-lehmann.de ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From elizar.palad at gmail.com Fri Sep 2 01:14:43 2005 From: elizar.palad at gmail.com (Elizar M. Palad) Date: Fri, 2 Sep 2005 07:14:43 +0800 Subject: @sysconfdir@/cgi.cfg - Installation setup problem In-Reply-To: References: Message-ID: Hi, Actually, in /usr/local/nagios/etc there are now some *sample files. the resource.cfg that i do use has the @libexec@ and the resource.cfg-sample file has the one set in /usr/local/nagios/libexec, so used that one. When i rerun nagios, i got the Return code of 137 is out of bounds in the webpage and ld.so.1: /usr/local/nagios/libexec/check_ping: fatal: libiconv.so.2: open failed: No such file or directory ing the terminal.. although i have the libiconv.so.2 file in /usr/local/lib I read somewhere that i have to set some variable to the path of the library but i forgot where/what it is? LIB_... something.. :) Thanks! On 9/1/05, Subhendu Ghosh wrote: > > configure looks good here. > > In /usr/local/nagios/etc - do you only have one resource.cfg file? or do > you have resource.cfg.in as well? > > you might want to delete every thing in /usr/local/nagios/etc and run > "make install-config" again to get the base sample-config > > -sg > > On Thu, 1 Sep 2005, Elizar M. Palad wrote: > > > Hi Subhendu, > > I rerun the ./configure... same options and i couldn't find any critical > > errors. > > I attached the output produced. The only thing that looks like an error > is > > the gd, which, according to the documentation, nagios will run without > it. > > > > # ./configure --prefix=/usr/local/nagios --with-cgiurl=/nagios/cgi-bin > > --with-htmlurl=/nagios --with-nagios-user=na > > gios --with-nagios-groupp=nagios --with-gd-lib=/usr/local/lib > > checking for a BSD compatible install... ./install-sh -c > > checking host system type... sparc-sun-solaris2.6 > > checking for gcc... gcc > > checking whether the C compiler (gcc ) works... yes > > checking whether the C compiler (gcc ) is a cross-compiler... no > > checking whether we are using GNU C... yes > > checking whether gcc accepts -g... yes > > checking whether make sets ${MAKE}... yes > > checking for strip... /usr/ccs/bin/strip > > checking how to run the C preprocessor... gcc -E > > checking for ANSI C header files... yes > > checking whether time.h and sys/time.h may both be included... yes > > checking for sys/wait.h that is POSIX.1 compatible... yes > > checking for arpa/inet.h... yes > > checking for ctype.h... yes > > checking for dirent.h... yes > > checking for errno.h... yes > > checking for fcntl.h... yes > > checking for getopt.h... no > > checking for grp.h... yes > > checking for limits.h... yes > > checking for math.h... yes > > checking for netdb.h... yes > > checking for netinet/in.h... yes > > checking for pthread.h... yes > > checking for pthreads.h... no > > checking for pwd.h... yes > > checking for regex.h... yes > > checking for signal.h... yes > > checking for socket.h... no > > checking for string.h... yes > > checking for strings.h... yes > > checking for sys/mman.h... yes > > checking for sys/types.h... yes > > checking for sys/time.h... yes > > checking for sys/resource.h... yes > > checking for sys/wait.h... (cached) yes > > checking for sys/socket.h... yes > > checking for sys/stat.h... yes > > checking for sys/timeb.h... yes > > checking for sys/un.h... yes > > checking for sys/ipc.h... yes > > checking for sys/msg.h... yes > > checking for sys/poll.h... yes > > checking for syslog.h... yes > > checking for uio.h... no > > checking for unistd.h... yes > > checking for working const... yes > > checking whether struct tm is in sys/time.h or time.h... time.h > > checking for tm_zone in struct tm... no > > checking for tzname... yes > > checking for mode_t... yes > > checking for pid_t... yes > > checking for size_t... yes > > checking return type of signal handlers... void > > checking for uid_t in sys/types.h... yes > > checking type of array argument to getgroups... gid_t > > checking for initgroups... yes > > checking for setenv... no > > checking for strdup... yes > > checking for strstr... yes > > checking for strtoul... yes > > checking for unsetenv... no > > checking for type of socket size... size_t > > checking for pthread_create in -lcma... no > > checking for pthread_create in -lpthread... yes > > checking for library containing nanosleep... -lposix4 > > checking for mail... /usr/bin/mail > > Init script directory: /etc/init.d > > We'll use default routines (in xdata/xsddefault.*) for status data > I/O... > > We'll use default routines (in xdata/xcddefault.*) for comment data > I/O... > > We'll use default routines (in xdata/xrddefault.*) for retention data > I/O... > > We'll use template-based routines (in xdata/xodtemplate.*) for object > data > > I/O... > > We'll use default routines (in xdata/xpddefault.*) for performance data > > I/O... > > We'll use default routines (in xdata/xdddefault.*) for scheduled > downtime > > data I/O... > > checking for main in -liconv... yes > > checking for gdImagePng in -lgd (order 1)... no > > checking for gdImagePng in -lgd (order 2)... no > > checking for gdImagePng in -lgd (order 3)... no > > checking for gdImagePng in -lgd (order 4)... no > > > > > > *** GD, PNG, and/or JPEG libraries could not be located... ********* > > > > Boutell's GD library is required to compile the statusmap, trends > > and histogram CGIs. Get it from http://www.boutell.com/gd/, compile > > it, and use the --with-gd-lib and --with-gd-inc arguments to specify > > the locations of the GD library and include files. > > > > NOTE: In addition to the gd-devel library, you'll also need to make > > sure you have the png-devel and jpeg-devel libraries installed > > on your system. > > > > NOTE: After you install the necessary libraries on your system: > > 1. Make sure /etc/ld.so.conf has an entry for the directory in > > which the GD, PNG, and JPEG libraries are installed. > > 2. Run 'ldconfig' to update the run-time linker options. > > 3. Run 'make clean' in the Nagios distribution to clean out > > any old references to your previous compile. > > 4. Rerun the configure script. > > > > NOTE: If you can't get the configure script to recognize the GD libs > > on your system, get over it and move on to other things. The > > CGIs that use the GD libs are just a small part of the entire > > Nagios package. Get everything else working first and then > > revisit the problem. Make sure to check the nagios-users > > mailing list archives for possible solutions to GD library > > problems when you resume your troubleshooting. > > > > ******************************************************************** > > > > > > checking for ltdl.h... no > > checking for dlfcn.h... yes > > checking for dlopen in -ldl... yes > > checking for extra flags needed to export symbols... none > > checking for linker flags for loadable modules... -G > > checking for linker flags for loadable modules... -G > > checking for traceroute... no > > checking for snprintf... yes > > checking for type va_list... yes > > checking for perl... /usr/local/bin/perl > > creating ./config.status > > creating Makefile > > creating subst > > creating pkginfo > > creating base/Makefile > > creating common/Makefile > > creating contrib/Makefile > > creating cgi/Makefile > > creating html/Makefile > > creating module/Makefile > > creating include/Makefile > > creating xdata/Makefile > > creating daemon-init > > creating html/index.html > > creating html/side.html > > creating include/config.h > > include/config.h is unchanged > > creating include/snprintf.h > > include/snprintf.h is unchanged > > creating include/nagios.h > > creating include/cgiutils.h > > include/cgiutils.h is unchanged > > > > Creating sample config files in sample-config/ ... > > > > > > *** Configuration summary for nagios 2.0b4 08-02-2005 ***: > > > > General Options: > > ------------------------- > > Nagios executable: nagios > > Nagios user/group: nagios,nagios > > Command user/group: nagios,nagios > > Embedded Perl: no > > Event Broker: yes > > Install ${prefix}: /usr/local/nagios > > Lock file: ${prefix}/var/nagios.lock > > Init directory: /etc/init.d > > Host OS: solaris2.6 > > > > Web Interface Options: > > ------------------------ > > HTML URL: http://localhost/nagios/ > > CGI URL: http://localhost/nagios/cgi-bin/ > > Traceroute (used by WAP): > > > > > > Review the options above for accuracy. If they look okay, > > type 'make all' to compile the main program and CGIs. > > > > > > On 9/1/05, Subhendu Ghosh wrote: > >> > >> On Thu, 1 Sep 2005, Elizar M. Palad wrote: > >> > >>> Hi! Ok, i rerun configure script after installed perl, seems ok. no > >> error > >>> and all > >>> the .h files are created and there's no more error in make > >> install-config (i > >>> had one before) > >>> i am using the minimal.cfg to test nagios. > >>> I started nagios without any options. > >>> /usr/local/nagios/bin/nagios /usr/local/nagios/etc/nagios.cfg > >>> and it says: > >>> sh: @libexecdir@/check_ping: not found > >>> sh: @libexecdir@/check_users: not found > >>> sh: @libexecdir@/check_procs: not found > >>> Is this ok? > >> > >> No - @libexecdir@ should have been fixed by configure. > >> > >> For nagios - there should be no errors in the configure process... > >> > >> > >>> The IE page now has an error... > >>> It appears as though you do not have permission to view information > for > >>> any of the hosts you requested... > >>> > >>> If you believe this is an error, check the HTTP server authentication > >>> requirements for accessing this CGI > >>> and check the authorization options in your CGI configuration file. > >>> I didn't set up any authentication for nagios. Maybe it caused the > error > >>> above? > >>> if it does, how to use nagios without authentication? > >>> thanks! > >>> > >>> > >>> On 8/31/05, Subhendu Ghosh wrote: > >>>> > >>>> > >>>> Yes - autoconf which creates configure uses perl :) > >>>> > >>>> You can either compile your own or sunfreeware.com > > >> has some nice pkgadd > >>>> versions. > >>>> > >>>> -sg > >>>> > >>>> > >>>> On Wed, 31 Aug 2005, Elizar M. Palad wrote: > >>>> > >>>>> Hi, thanks! > >>>>> from ./configure's output: > >>>>> > >>>>> creating html/index.html > >>>>> creating html/side.html > >>>>> creating include/config.h > >>>>> creating include/snprintf.h > >>>>> creating include/nagios.h > >>>>> creating include/cgiutils.h > >>>>> ./configure: perl: not found > >>>>> ./configure: perl: not found > >>>>> > >>>>> Creating sample config files in sample-config/ ... > >>>>> ./configure: perl: not found > >>>>> ./configure: perl: not found > >>>>> ./configure: perl: not found > >>>>> ./configure: perl: not found > >>>>> ./configure: perl: not found > >>>>> ./configure: perl: not found > >>>>> ./configure: perl: not found > >>>>> ./configure: perl: not found > >>>>> > >>>>> > >>>>> *** Configuration summary for nagios 2.0b4 08-02-2005 ***: > >>>>> > >>>>> General Options: > >>>>> ------------------------- > >>>>> Nagios executable: nagios > >>>>> Nagios user/group: nagios,nagios > >>>>> Command user/group: nagios,nagios > >>>>> Embedded Perl: no > >>>>> Event Broker: yes > >>>>> Install ${prefix}: /usr/local/nagios > >>>>> Lock file: ${prefix}/var/nagios.lock > >>>>> Init directory: /etc/init.d > >>>>> Host OS: solaris2.6 > >>>>> .... > >>>>> Im going to install now perl and see if it will create the > >> location.hfile. > >>>>> Thanks! > >>>>> eli > >>>>> > >>>>> On 8/31/05, Subhendu Ghosh wrote: > >>>>>> > >>>>>> On Wed, 31 Aug 2005, Elizar M. Palad wrote: > >>>>>> > >>>>>>> I rerun ./configure --prefix=/usr/local/nagios > >>>>>> --with-cgiurl=/nagios/cgi-bin > >>>>>>> --with-htmlurl=/nagios --with-nagios-user=nagios > >>>>>> --with-nagios-groupp=nagios > >>>>>>> --with-gd-lib=/usr/local/lib > >>>>>>> and summary is below: > >>>>>>> *** Configuration summary for nagios 2.0b4 08-02-2005 ***: > >>>>>>> > >>>>>>> General Options: > >>>>>>> ------------------------- > >>>>>>> Nagios executable: nagios > >>>>>>> Nagios user/group: nagios,nagios > >>>>>>> Command user/group: nagios,nagios > >>>>>>> Embedded Perl: no > >>>>>>> Event Broker: yes > >>>>>>> Install ${prefix}: /usr/local/nagios > >>>>>>> Lock file: ${prefix}/var/nagios.lock > >>>>>>> Init directory: /etc/init.d > >>>>>>> Host OS: solaris2.6 > >>>>>>> > >>>>>>> Web Interface Options: > >>>>>>> ------------------------ > >>>>>>> HTML URL: http://localhost/nagios/ > >>>>>>> CGI URL: http://localhost/nagios/cgi-bin/ > >>>>>>> Traceroute (used by WAP): > >>>>>>> > >>>>>>> > >>>>>>> Review the options above for accuracy. If they look okay, > >>>>>>> type 'make all' to compile the main program and CGIs. > >>>>>>> ------------------ > >>>>>>> Looks ok isn't it? > >>>>>>> then when i do 'make all', i have this error: > >>>>>>> # make all > >>>>>>> cd ./base && make > >>>>>>> make: Fatal error: Don't know how to make target > >>>>>> `../include/locations.h' > >>>>>>> Current working directory /packages/nagios-2.0b4/base > >>>>>>> *** Error code 1 > >>>>>>> make: Fatal error: Command failed for target `all' > >>>>>>> This is also my second time with this error. What I did was i copy > >>>>>>> locations.h.in < > http://locations.h.in> < > >> http://locations.h.in> < > >>>> http://locations.h.in> as > >>>>>> locations.h :-) > >>>>>>> and the compilation continues.. (was that not right? :) > >>>>>>> thanks! > >>>>>> > >>>>>> > >>>>>> NOOOOO - configure must have produced an error before the summary > was > >>>>>> printed. > >>>>>> > >>>>>> configure should create a locations.h from locations.h.in > > >> > >>>> after > >>>>>> appropriate substitutions. > >>>>>> > >>>>>> If the file is not being created properly - you may have a file > >>>> permission > >>>>>> issue. > >>>>>> > >>>>>> run "chown -R user:group nagios_src_dir/" > >>>>>> > >>>>>> replace the user and group with the user and group info fort he > >> person > >>>>>> running configure > >>>>>> > >>>>>> replace nagios_src_dir with the dir that contains the nagios src. > >>>>>> > >>>>>> Never copy a *.*.in file into a *.* file. > >>>>>> > >>>>>> > >>>>>> > >>>>>> -- > >>>>>> > >>>>>> -sg > >>>>>> > >>>>> > >>>>> > >>>>> > >>>>> > >>>> > >>>> -- > >>>> > >>>> > >>> > >>> > >>> > >> > >> -- > >> > >> > >> > >> ------------------------------------------------------- > >> SF.Net email is Sponsored by the Better Software Conference & EXPO > >> September 19-22, 2005 * San Francisco, CA * Development Lifecycle > >> Practices > >> Agile & Plan-Driven Development * Managing Projects & Teams * Testing & > QA > >> Security * Process Improvement & Measurement * > http://www.sqe.com/bsce5sf > >> _______________________________________________ > >> Nagios-users mailing list > >> Nagios-users at lists.sourceforge.net > >> https://lists.sourceforge.net/lists/listinfo/nagios-users > >> ::: Please include Nagios version, plugin version (-v) and OS when > >> reporting any issue. > >> ::: Messages without supporting info will risk being sent to /dev/null > >> > > > > > > > > > > -- > > -- ---- Don't Tell Me How Hard You Work.. Show Me How Much You'd Accomplished.. -------------- next part -------------- An HTML attachment was scrubbed... URL: From elizar.palad at gmail.com Fri Sep 2 01:46:48 2005 From: elizar.palad at gmail.com (Elizar M. Palad) Date: Fri, 2 Sep 2005 07:46:48 +0800 Subject: @sysconfdir@/cgi.cfg - Installation setup problem In-Reply-To: References: Message-ID: Hi Subhendu! Found it! :-) http://www.nagios.org/faqs/viewfaq.php?faq_id=17 Will continue to learn nagios! Thanks for the help! and you guys will hear from me soon, i bet. ;) Thanks! eli On 9/2/05, Elizar M. Palad wrote: > > Hi, > Actually, in /usr/local/nagios/etc there are now some *sample files. > the resource.cfg that i do use has the @libexec@ and the > resource.cfg-sample > file has the one set in /usr/local/nagios/libexec, so used that one. > When i rerun nagios, i got the > Return code of 137 is out of bounds > in the webpage and > ld.so.1: /usr/local/nagios/libexec/check_ping: fatal: libiconv.so.2: open > failed: No such file or directory > ing the terminal.. > although i have the libiconv.so.2 file in /usr/local/lib > I read somewhere that i have to set some variable to the path > of the library but i forgot where/what it is? LIB_... something.. :) > Thanks! > > On 9/1/05, Subhendu Ghosh wrote: > > > > configure looks good here. > > > > In /usr/local/nagios/etc - do you only have one resource.cfg file? or do > > > > you have resource.cfg.in as well? > > > > you might want to delete every thing in /usr/local/nagios/etc and run > > "make install-config" again to get the base sample-config > > > > -sg > > > > On Thu, 1 Sep 2005, Elizar M. Palad wrote: > > > > > Hi Subhendu, > > > I rerun the ./configure... same options and i couldn't find any > > critical > > > errors. > > > I attached the output produced. The only thing that looks like an > > error is > > > the gd, which, according to the documentation, nagios will run without > > it. > > > > > > # ./configure --prefix=/usr/local/nagios --with-cgiurl=/nagios/cgi-bin > > > --with-htmlurl=/nagios --with-nagios-user=na > > > gios --with-nagios-groupp=nagios --with-gd-lib=/usr/local/lib > > > checking for a BSD compatible install... ./install-sh -c > > > checking host system type... sparc-sun-solaris2.6 > > > checking for gcc... gcc > > > checking whether the C compiler (gcc ) works... yes > > > checking whether the C compiler (gcc ) is a cross-compiler... no > > > checking whether we are using GNU C... yes > > > checking whether gcc accepts -g... yes > > > checking whether make sets ${MAKE}... yes > > > checking for strip... /usr/ccs/bin/strip > > > checking how to run the C preprocessor... gcc -E > > > checking for ANSI C header files... yes > > > checking whether time.h and sys/time.h may both be included... yes > > > checking for sys/wait.h that is POSIX.1 compatible... yes > > > checking for arpa/inet.h... yes > > > checking for ctype.h... yes > > > checking for dirent.h.. . yes > > > checking for errno.h... yes > > > checking for fcntl.h... yes > > > checking for getopt.h... no > > > checking for grp.h... yes > > > checking for limits.h... yes > > > checking for math.h... yes > > > checking for netdb.h... yes > > > checking for netinet/in.h... yes > > > checking for pthread.h... yes > > > checking for pthreads.h... no > > > checking for pwd.h... yes > > > checking for regex.h... yes > > > checking for signal.h... yes > > > checking for socket.h... no > > > checking for string.h... yes > > > checking for strings.h... yes > > > checking for sys/mman.h... yes > > > checking for sys/types.h... yes > > > checking for sys/time.h... yes > > > checking for sys/resource.h... yes > > > checking for sys/wait.h... (cached) yes > > > checking for sys/socket.h... yes > > > checking for sys/stat.h... yes > > > checking for sys/timeb.h... yes > > > checking for sys/un.h... yes > > > checking for sys/ipc.h... yes > > > checking for sys/msg.h... yes > > > checking for sys/poll.h... yes > > > checking for syslog.h... yes > > > checking for uio.h... no > > > checking for unistd.h... yes > > > checking for working const... yes > > > checking whether struct tm is in sys/time.h or time.h... time.h > > > checking for tm_zone in struct tm... no > > > checking for tzname... yes > > > checking for mode_t... yes > > > checking for pid_t... yes > > > checking for size_t... yes > > > checking return type of signal handlers... void > > > checking for uid_t in sys/types.h... yes > > > checking type of array argument to getgroups... gid_t > > > checking for initgroups... yes > > > checking for setenv... no > > > checking for strdup... yes > > > checking for strstr... yes > > > checking for strtoul... yes > > > checking for unsetenv... no > > > checking for type of socket size... size_t > > > checking for pthread_create in -lcma... no > > > checking for pthread_create in -lpthread... yes > > > checking for library containing nanosleep... -lposix4 > > > checking for mail... /usr/bin/mail > > > Init script directory: /etc/init.d > > > We'll use default routines (in xdata/xsddefault.*) for status data > > I/O... > > > We'll use default routines (in xdata/xcddefault.*) for comment data > > I/O... > > > We'll use default routines (in xdata/xrddefault.*) for retention data > > I/O... > > > We'll use template-based routines (in xdata/xodtemplate.*) for object > > data > > > I/O... > > > We'll use default routines (in xdata/xpddefault.*) for performance > > data > > > I/O... > > > We'll use default routines (in xdata/xdddefault.*) for scheduled > > downtime > > > data I/O... > > > checking for main in -liconv... yes > > > checking for gdImagePng in -lgd (order 1)... no > > > checking for gdImagePng in -lgd (order 2)... no > > > checking for gdImagePng in -lgd (order 3)... no > > > checking for gdImagePng in -lgd (order 4)... no > > > > > > > > > *** GD, PNG, and/or JPEG libraries could not be located... ********* > > > > > > Boutell's GD library is required to compile the statusmap, trends > > > and histogram CGIs. Get it from http://www.boutell.com/gd/, compile > > > it, and use the --with-gd-lib and --with-gd-inc arguments to specify > > > the locations of the GD library and include files. > > > > > > NOTE: In addition to the gd-devel library, you'll also need to make > > > sure you have the png-devel and jpeg-devel libraries installed > > > on your system. > > > > > > NOTE: After you install the necessary libraries on your system: > > > 1. Make sure /etc/ld.so.conf has an entry for the directory in > > > which the GD, PNG, and JPEG libraries are installed. > > > 2. Run 'ldconfig' to update the run-time linker options. > > > 3. Run 'make clean' in the Nagios distribution to clean out > > > any old references to your previous compile. > > > 4. Rerun the configure script. > > > > > > NOTE: If you can't get the configure script to recognize the GD libs > > > on your system, get over it and move on to other things. The > > > CGIs that use the GD libs are just a small part of the entire > > > Nagios package. Get everything else working first and then > > > revisit the problem. Make sure to check the nagios-users > > > mailing list archives for possible solutions to GD library > > > problems when you resume your troubleshooting. > > > > > > ******************************************************************** > > > > > > > > > checking for ltdl.h... no > > > checking for dlfcn.h.. . yes > > > checking for dlopen in -ldl... yes > > > checking for extra flags needed to export symbols... none > > > checking for linker flags for loadable modules... -G > > > checking for linker flags for loadable modules... -G > > > checking for traceroute... no > > > checking for snprintf... yes > > > checking for type va_list... yes > > > checking for perl... /usr/local/bin/perl > > > creating ./config.status > > > creating Makefile > > > creating subst > > > creating pkginfo > > > creating base/Makefile > > > creating common/Makefile > > > creating contrib/Makefile > > > creating cgi/Makefile > > > creating html/Makefile > > > creating module/Makefile > > > creating include/Makefile > > > creating xdata/Makefile > > > creating daemon-init > > > creating html/index.html > > > creating html/side.html > > > creating include/config.h > > > include/config.h is unchanged > > > creating include/snprintf.h > > > include/snprintf.h is unchanged > > > creating include/nagios.h > > > creating include/cgiutils.h > > > include/cgiutils.h is unchanged > > > > > > Creating sample config files in sample-config/ ... > > > > > > > > > *** Configuration summary for nagios 2.0b4 08-02-2005 ***: > > > > > > General Options: > > > ------------------------- > > > Nagios executable: nagios > > > Nagios user/group: nagios,nagios > > > Command user/group: nagios,nagios > > > Embedded Perl: no > > > Event Broker: yes > > > Install ${prefix}: /usr/local/nagios > > > Lock file: ${prefix}/var/nagios.lock > > > Init directory: /etc/init.d > > > Host OS: solaris2.6 > > > > > > Web Interface Options: > > > ------------------------ > > > HTML URL: http://localhost/nagios/ > > > CGI URL: http://localhost/nagios/cgi-bin/ > > > Traceroute (used by WAP): > > > > > > > > > Review the options above for accuracy. If they look okay, > > > type 'make all' to compile the main program and CGIs. > > > > > > > > > On 9/1/05, Subhendu Ghosh wrote: > > >> > > >> On Thu, 1 Sep 2005, Elizar M. Palad wrote: > > >> > > >>> Hi! Ok, i rerun configure script after installed perl, seems ok. no > > >> error > > >>> and all > > >>> the .h files are created and there's no more error in make > > >> install-config (i > > >>> had one before) > > >>> i am using the minimal.cfg to test nagios. > > >>> I started nagios without any options. > > >>> /usr/local/nagios/bin/nagios /usr/local/nagios/etc/nagios.cfg > > >>> and it says: > > >>> sh: @libexecdir@/check_ping: not found > > >>> sh: @libexecdir@/check_users: not found > > >>> sh: @libexecdir@/check_procs: not found > > >>> Is this ok? > > >> > > >> No - @libexecdir@ should have been fixed by configure. > > >> > > >> For nagios - there should be no errors in the configure process... > > >> > > >> > > >>> The IE page now has an error... > > >>> It appears as though you do not have permission to view information > > for > > >>> any of the hosts you requested... > > >>> > > >>> If you believe this is an error, check the HTTP server > > authentication > > >>> requirements for accessing this CGI > > >>> and check the authorization options in your CGI configuration file. > > >>> I didn't set up any authentication for nagios. Maybe it caused the > > error > > >>> above? > > >>> if it does, how to use nagios without authentication? > > >>> thanks! > > >>> > > >>> > > >>> On 8/31/05, Subhendu Ghosh wrote: > > >>>> > > >>>> > > >>>> Yes - autoconf which creates configure uses perl :) > > >>>> > > >>>> You can either compile your own or sunfreeware.com > > > > >> < http://sunfreeware.com>has some nice pkgadd > > >>>> versions. > > >>>> > > >>>> -sg > > >>>> > > >>>> > > >>>> On Wed, 31 Aug 2005, Elizar M. Palad wrote: > > >>>> > > >>>>> Hi, thanks! > > >>>>> from ./configure's output: > > >>>>> > > >>>>> creating html/index.html > > >>>>> creating html/side.html > > >>>>> creating include/config.h > > >>>>> creating include/snprintf.h > > >>>>> creating include/nagios.h > > >>>>> creating include/cgiutils.h > > >>>>> ./configure: perl: not found > > >>>>> ./configure: perl: not found > > >>>>> > > >>>>> Creating sample config files in sample-config/ ... > > >>>>> ./configure: perl: not found > > >>>>> ./configure: perl: not found > > >>>>> ./configure: perl: not found > > >>>>> ./configure: perl: not found > > >>>>> ./configure: perl: not found > > >>>>> ./configure: perl: not found > > >>>>> ./configure: perl: not found > > >>>>> ./configure: perl: not found > > >>>>> > > >>>>> > > >>>>> *** Configuration summary for nagios 2.0b4 08-02-2005 ***: > > >>>>> > > >>>>> General Options: > > >>>>> ------------------------- > > >>>>> Nagios executable: nagios > > >>>>> Nagios user/group: nagios,nagios > > >>>>> Command user/group: nagios,nagios > > >>>>> Embedded Perl: no > > >>>>> Event Broker: yes > > >>>>> Install ${prefix}: /usr/local/nagios > > >>>>> Lock file: ${prefix}/var/nagios.lock > > >>>>> Init directory: /etc/init.d > > >>>>> Host OS: solaris2.6 > > >>>>> .... > > >>>>> Im going to install now perl and see if it will create the > > >> location.hfile. > > >>>>> Thanks! > > >>>>> eli > > >>>>> > > >>>>> On 8/31/05, Subhendu Ghosh wrote: > > >>>>>> > > >>>>>> On Wed, 31 Aug 2005, Elizar M. Palad wrote: > > >>>>>> > > >>>>>>> I rerun ./configure --prefix=/usr/local/nagios > > >>>>>> --with-cgiurl=/nagios/cgi-bin > > >>>>>>> --with-htmlurl=/nagios --with-nagios-user=nagios > > >>>>>> --with-nagios-groupp=nagios > > >>>>>>> --with-gd-lib=/usr/local/lib > > >>>>>>> and summary is below: > > >>>>>>> *** Configuration summary for nagios 2.0b4 08-02-2005 ***: > > >>>>>>> > > >>>>>>> General Options: > > >>>>>>> ------------------------- > > >>>>>>> Nagios executable: nagios > > >>>>>>> Nagios user/group: nagios,nagios > > >>>>>>> Command user/group: nagios,nagios > > >>>>>>> Embedded Perl: no > > >>>>>>> Event Broker: yes > > >>>>>>> Install ${prefix}: /usr/local/nagios > > >>>>>>> Lock file: ${prefix}/var/nagios.lock > > >>>>>>> Init directory: /etc/init.d > > >>>>>>> Host OS: solaris2.6 > > >>>>>>> > > >>>>>>> Web Interface Options: > > >>>>>>> ------------------------ > > >>>>>>> HTML URL: http://localhost/nagios/ > > >>>>>>> CGI URL: http://localhost/nagios/cgi-bin/ > > >>>>>>> Traceroute (used by WAP): > > >>>>>>> > > >>>>>>> > > >>>>>>> Review the options above for accuracy. If they look okay, > > >>>>>>> type 'make all' to compile the main program and CGIs. > > >>>>>>> ------------------ > > >>>>>>> Looks ok isn't it? > > >>>>>>> then when i do 'make all', i have this error: > > >>>>>>> # make all > > >>>>>>> cd ./base && make > > >>>>>>> make: Fatal error: Don't know how to make target > > >>>>>> `../include/locations.h' > > >>>>>>> Current working directory /packages/nagios-2.0b4/base > > >>>>>>> *** Error code 1 > > >>>>>>> make: Fatal error: Command failed for target `all' > > >>>>>>> This is also my second time with this error. What I did was i > > copy > > >>>>>>> locations.h.in > > < > > >> http://locations.h.in> < > > >>>> http://locations.h.in> as > > >>>>>> locations.h :-) > > >>>>>>> and the compilation continues.. (was that not right? :) > > >>>>>>> thanks! > > >>>>>> > > >>>>>> > > >>>>>> NOOOOO - configure must have produced an error before the summary > > was > > >>>>>> printed. > > >>>>>> > > >>>>>> configure should create a locations.h from locations.h.in > > > > >> < http://locations.h.in> > > >>>> after > > >>>>>> appropriate substitutions. > > >>>>>> > > >>>>>> If the file is not being created properly - you may have a file > > >>>> permission > > >>>>>> issue. > > >>>>>> > > >>>>>> run "chown -R user:group nagios_src_dir/" > > >>>>>> > > >>>>>> replace the user and group with the user and group info fort he > > >> person > > >>>>>> running configure > > >>>>>> > > >>>>>> replace nagios_src_dir with the dir that contains the nagios src. > > > > >>>>>> > > >>>>>> Never copy a *.*.in file into a *.* file. > > >>>>>> > > >>>>>> > > >>>>>> > > >>>>>> -- > > >>>>>> > > >>>>>> -sg > > >>>>>> > > >>>>> > > >>>>> > > >>>>> > > >>>>> > > >>>> > > >>>> -- > > >>>> > > >>>> > > >>> > > >>> > > >>> > > >> > > >> -- > > >> > > >> > > >> > > >> ------------------------------------------------------- > > >> SF.Net email is Sponsored by the Better Software Conference & EXPO > > >> September 19-22, 2005 * San Francisco, CA * Development Lifecycle > > >> Practices > > >> Agile & Plan-Driven Development * Managing Projects & Teams * Testing > > & QA > > >> Security * Process Improvement & Measurement * > > http://www.sqe.com/bsce5sf > > >> _______________________________________________ > > >> Nagios-users mailing list > > >> Nagios-users at lists.sourceforge.net > > >> https://lists.sourceforge.net/lists/listinfo/nagios-users > > >> ::: Please include Nagios version, plugin version (-v) and OS when > > >> reporting any issue. > > >> ::: Messages without supporting info will risk being sent to > > /dev/null > > >> > > > > > > > > > > > > > > > > -- > > > > > > > -- > ---- > Don't Tell Me How Hard You Work.. > Show Me How Much You'd Accomplished.. > -- ---- Don't Tell Me How Hard You Work.. Show Me How Much You'd Accomplished.. -------------- next part -------------- An HTML attachment was scrubbed... URL: From todd_barbera at wgbh.org Fri Sep 2 02:55:27 2005 From: todd_barbera at wgbh.org (Todd Barbera) Date: Thu, 01 Sep 2005 20:55:27 -0400 Subject: @sysconfdir@/cgi.cfg - Installation setup problem References: Message-ID: <001b01c5af59$02c45aa0$0d7810ac@wgbh.org> Hi Eli, You can actually use crle to set your library path instead of having to set an environment variable. It seems to work pretty well on my Solaris systems. Ex: crle -l /usr/lib:/usr/ccs/lib:/usr/local/lib:/usr/local/ssl/lib Todd ----- Original Message ----- From: Elizar M. Palad To: Subhendu Ghosh Cc: nagios-users at lists.sourceforge.net Sent: Thursday, September 01, 2005 7:46 PM Subject: Re: [Nagios-users] @sysconfdir@/cgi.cfg - Installation setup problem Hi Subhendu! Found it! :-) http://www.nagios.org/faqs/viewfaq.php?faq_id=17 Will continue to learn nagios! Thanks for the help! and you guys will hear from me soon, i bet. ;) Thanks! eli On 9/2/05, Elizar M. Palad wrote: Hi, Actually, in /usr/local/nagios/etc there are now some *sample files. the resource.cfg that i do use has the @libexec@ and the resource.cfg-sample file has the one set in /usr/local/nagios/libexec, so used that one. When i rerun nagios, i got the Return code of 137 is out of bounds in the webpage and ld.so.1: /usr/local/nagios/libexec/check_ping: fatal: libiconv.so.2: open failed: No such file or directory ing the terminal.. although i have the libiconv.so.2 file in /usr/local/lib I read somewhere that i have to set some variable to the path of the library but i forgot where/what it is? LIB_... something.. :) Thanks! On 9/1/05, Subhendu Ghosh wrote: configure looks good here. In /usr/local/nagios/etc - do you only have one resource.cfg file? or do you have resource.cfg.in as well? you might want to delete every thing in /usr/local/nagios/etc and run "make install-config" again to get the base sample-config -sg On Thu, 1 Sep 2005, Elizar M. Palad wrote: > Hi Subhendu, > I rerun the ./configure... same options and i couldn't find any critical > errors. > I attached the output produced. The only thing that looks like an error is > the gd, which, according to the documentation, nagios will run without it. > > # ./configure --prefix=/usr/local/nagios --with-cgiurl=/nagios/cgi-bin > --with-htmlurl=/nagios --with-nagios-user=na > gios --with-nagios-groupp=nagios --with-gd-lib=/usr/local/lib > checking for a BSD compatible install... ./install-sh -c > checking host system type... sparc-sun-solaris2.6 > checking for gcc... gcc > checking whether the C compiler (gcc ) works... yes > checking whether the C compiler (gcc ) is a cross-compiler... no > checking whether we are using GNU C... yes > checking whether gcc accepts -g... yes > checking whether make sets ${MAKE}... yes > checking for strip... /usr/ccs/bin/strip > checking how to run the C preprocessor... gcc -E > checking for ANSI C header files... yes > checking whether time.h and sys/time.h may both be included... yes > checking for sys/wait.h that is POSIX.1 compatible... yes > checking for arpa/inet.h... yes > checking for ctype.h... yes > checking for dirent.h.. . yes > checking for errno.h... yes > checking for fcntl.h... yes > checking for getopt.h... no > checking for grp.h... yes > checking for limits.h... yes > checking for math.h... yes > checking for netdb.h... yes > checking for netinet/in.h... yes > checking for pthread.h... yes > checking for pthreads.h... no > checking for pwd.h... yes > checking for regex.h... yes > checking for signal.h... yes > checking for socket.h... no > checking for string.h... yes > checking for strings.h... yes > checking for sys/mman.h... yes > checking for sys/types.h... yes > checking for sys/time.h... yes > checking for sys/resource.h... yes > checking for sys/wait.h... (cached) yes > checking for sys/socket.h... yes > checking for sys/stat.h... yes > checking for sys/timeb.h... yes > checking for sys/un.h... yes > checking for sys/ipc.h... yes > checking for sys/msg.h... yes > checking for sys/poll.h... yes > checking for syslog.h... yes > checking for uio.h... no > checking for unistd.h... yes > checking for working const... yes > checking whether struct tm is in sys/time.h or time.h... time.h > checking for tm_zone in struct tm... no > checking for tzname... yes > checking for mode_t... yes > checking for pid_t... yes > checking for size_t... yes > checking return type of signal handlers... void > checking for uid_t in sys/types.h... yes > checking type of array argument to getgroups... gid_t > checking for initgroups... yes > checking for setenv... no > checking for strdup... yes > checking for strstr... yes > checking for strtoul... yes > checking for unsetenv... no > checking for type of socket size... size_t > checking for pthread_create in -lcma... no > checking for pthread_create in -lpthread... yes > checking for library containing nanosleep... -lposix4 > checking for mail... /usr/bin/mail > Init script directory: /etc/init.d > We'll use default routines (in xdata/xsddefault.*) for status data I/O... > We'll use default routines (in xdata/xcddefault.*) for comment data I/O... > We'll use default routines (in xdata/xrddefault.*) for retention data I/O... > We'll use template-based routines (in xdata/xodtemplate.*) for object data > I/O... > We'll use default routines (in xdata/xpddefault.*) for performance data > I/O... > We'll use default routines (in xdata/xdddefault.*) for scheduled downtime > data I/O... > checking for main in -liconv... yes > checking for gdImagePng in -lgd (order 1)... no > checking for gdImagePng in -lgd (order 2)... no > checking for gdImagePng in -lgd (order 3)... no > checking for gdImagePng in -lgd (order 4)... no > > > *** GD, PNG, and/or JPEG libraries could not be located... ********* > > Boutell's GD library is required to compile the statusmap, trends > and histogram CGIs. Get it from http://www.boutell.com/gd/, compile > it, and use the --with-gd-lib and --with-gd-inc arguments to specify > the locations of the GD library and include files. > > NOTE: In addition to the gd-devel library, you'll also need to make > sure you have the png-devel and jpeg-devel libraries installed > on your system. > > NOTE: After you install the necessary libraries on your system: > 1. Make sure /etc/ld.so.conf has an entry for the directory in > which the GD, PNG, and JPEG libraries are installed. > 2. Run 'ldconfig' to update the run-time linker options. > 3. Run 'make clean' in the Nagios distribution to clean out > any old references to your previous compile. > 4. Rerun the configure script. > > NOTE: If you can't get the configure script to recognize the GD libs > on your system, get over it and move on to other things. The > CGIs that use the GD libs are just a small part of the entire > Nagios package. Get everything else working first and then > revisit the problem. Make sure to check the nagios-users > mailing list archives for possible solutions to GD library > problems when you resume your troubleshooting. > > ******************************************************************** > > > checking for ltdl.h... no > checking for dlfcn.h.. . yes > checking for dlopen in -ldl... yes > checking for extra flags needed to export symbols... none > checking for linker flags for loadable modules... -G > checking for linker flags for loadable modules... -G > checking for traceroute... no > checking for snprintf... yes > checking for type va_list... yes > checking for perl... /usr/local/bin/perl > creating ./config.status > creating Makefile > creating subst > creating pkginfo > creating base/Makefile > creating common/Makefile > creating contrib/Makefile > creating cgi/Makefile > creating html/Makefile > creating module/Makefile > creating include/Makefile > creating xdata/Makefile > creating daemon-init > creating html/index.html > creating html/side.html > creating include/config.h > include/config.h is unchanged > creating include/snprintf.h > include/snprintf.h is unchanged > creating include/nagios.h > creating include/cgiutils.h > include/cgiutils.h is unchanged > > Creating sample config files in sample-config/ ... > > > *** Configuration summary for nagios 2.0b4 08-02-2005 ***: > > General Options: > ------------------------- > Nagios executable: nagios > Nagios user/group: nagios,nagios > Command user/group: nagios,nagios > Embedded Perl: no > Event Broker: yes > Install ${prefix}: /usr/local/nagios > Lock file: ${prefix}/var/nagios.lock > Init directory: /etc/init.d > Host OS: solaris2.6 > > Web Interface Options: > ------------------------ > HTML URL: http://localhost/nagios/ > CGI URL: http://localhost/nagios/cgi-bin/ > Traceroute (used by WAP): > > > Review the options above for accuracy. If they look okay, > type 'make all' to compile the main program and CGIs. > > > On 9/1/05, Subhendu Ghosh < sghosh at sghosh.org> wrote: >> >> On Thu, 1 Sep 2005, Elizar M. Palad wrote: >> >>> Hi! Ok, i rerun configure script after installed perl, seems ok. no >> error >>> and all >>> the .h files are created and there's no more error in make >> install-config (i >>> had one before) >>> i am using the minimal.cfg to test nagios. >>> I started nagios without any options. >>> /usr/local/nagios/bin/nagios /usr/local/nagios/etc/nagios.cfg >>> and it says: >>> sh: @libexecdir@/check_ping: not found >>> sh: @libexecdir@/check_users: not found >>> sh: @libexecdir@/check_procs: not found >>> Is this ok? >> >> No - @libexecdir@ should have been fixed by configure. >> >> For nagios - there should be no errors in the configure process... >> >> >>> The IE page now has an error... >>> It appears as though you do not have permission to view information for >>> any of the hosts you requested... >>> >>> If you believe this is an error, check the HTTP server authentication >>> requirements for accessing this CGI >>> and check the authorization options in your CGI configuration file. >>> I didn't set up any authentication for nagios. Maybe it caused the error >>> above? >>> if it does, how to use nagios without authentication? >>> thanks! >>> >>> >>> On 8/31/05, Subhendu Ghosh wrote: >>>> >>>> >>>> Yes - autoconf which creates configure uses perl :) >>>> >>>> You can either compile your own or sunfreeware.com >> < http://sunfreeware.com>has some nice pkgadd >>>> versions. >>>> >>>> -sg >>>> >>>> >>>> On Wed, 31 Aug 2005, Elizar M. Palad wrote: >>>> >>>>> Hi, thanks! >>>>> from ./configure's output: >>>>> >>>>> creating html/index.html >>>>> creating html/side.html >>>>> creating include/config.h >>>>> creating include/snprintf.h >>>>> creating include/nagios.h >>>>> creating include/cgiutils.h >>>>> ./configure: perl: not found >>>>> ./configure: perl: not found >>>>> >>>>> Creating sample config files in sample-config/ ... >>>>> ./configure: perl: not found >>>>> ./configure: perl: not found >>>>> ./configure: perl: not found >>>>> ./configure: perl: not found >>>>> ./configure: perl: not found >>>>> ./configure: perl: not found >>>>> ./configure: perl: not found >>>>> ./configure: perl: not found >>>>> >>>>> >>>>> *** Configuration summary for nagios 2.0b4 08-02-2005 ***: >>>>> >>>>> General Options: >>>>> ------------------------- >>>>> Nagios executable: nagios >>>>> Nagios user/group: nagios,nagios >>>>> Command user/group: nagios,nagios >>>>> Embedded Perl: no >>>>> Event Broker: yes >>>>> Install ${prefix}: /usr/local/nagios >>>>> Lock file: ${prefix}/var/nagios.lock >>>>> Init directory: /etc/init.d >>>>> Host OS: solaris2.6 >>>>> .... >>>>> Im going to install now perl and see if it will create the >> location.hfile. >>>>> Thanks! >>>>> eli >>>>> >>>>> On 8/31/05, Subhendu Ghosh wrote: >>>>>> >>>>>> On Wed, 31 Aug 2005, Elizar M. Palad wrote: >>>>>> >>>>>>> I rerun ./configure --prefix=/usr/local/nagios >>>>>> --with-cgiurl=/nagios/cgi-bin >>>>>>> --with-htmlurl=/nagios --with-nagios-user=nagios >>>>>> --with-nagios-groupp=nagios >>>>>>> --with-gd-lib=/usr/local/lib >>>>>>> and summary is below: >>>>>>> *** Configuration summary for nagios 2.0b4 08-02-2005 ***: >>>>>>> >>>>>>> General Options: >>>>>>> ------------------------- >>>>>>> Nagios executable: nagios >>>>>>> Nagios user/group: nagios,nagios >>>>>>> Command user/group: nagios,nagios >>>>>>> Embedded Perl: no >>>>>>> Event Broker: yes >>>>>>> Install ${prefix}: /usr/local/nagios >>>>>>> Lock file: ${prefix}/var/nagios.lock >>>>>>> Init directory: /etc/init.d >>>>>>> Host OS: solaris2.6 >>>>>>> >>>>>>> Web Interface Options: >>>>>>> ------------------------ >>>>>>> HTML URL: http://localhost/nagios/ >>>>>>> CGI URL: http://localhost/nagios/cgi-bin/ >>>>>>> Traceroute (used by WAP): >>>>>>> >>>>>>> >>>>>>> Review the options above for accuracy. If they look okay, >>>>>>> type 'make all' to compile the main program and CGIs. >>>>>>> ------------------ >>>>>>> Looks ok isn't it? >>>>>>> then when i do 'make all', i have this error: >>>>>>> # make all >>>>>>> cd ./base && make >>>>>>> make: Fatal error: Don't know how to make target >>>>>> `../include/locations.h' >>>>>>> Current working directory /packages/nagios-2.0b4/base >>>>>>> *** Error code 1 >>>>>>> make: Fatal error: Command failed for target `all' >>>>>>> This is also my second time with this error. What I did was i copy >>>>>>> locations.h.in < http://locations.h.in> < >> http://locations.h.in> < >>>> http://locations.h.in> as >>>>>> locations.h :-) >>>>>>> and the compilation continues.. (was that not right? :) >>>>>>> thanks! >>>>>> >>>>>> >>>>>> NOOOOO - configure must have produced an error before the summary was >>>>>> printed. >>>>>> >>>>>> configure should create a locations.h from locations.h.in< http://locations.h.in> >> < http://locations.h.in> >>>> < http://locations.h.in>after >>>>>> appropriate substitutions. >>>>>> >>>>>> If the file is not being created properly - you may have a file >>>> permission >>>>>> issue. >>>>>> >>>>>> run "chown -R user:group nagios_src_dir/" >>>>>> >>>>>> replace the user and group with the user and group info fort he >> person >>>>>> running configure >>>>>> >>>>>> replace nagios_src_dir with the dir that contains the nagios src. >>>>>> >>>>>> Never copy a *.*.in file into a *.* file. >>>>>> >>>>>> >>>>>> >>>>>> -- >>>>>> >>>>>> -sg >>>>>> >>>>> >>>>> >>>>> >>>>> >>>> >>>> -- >>>> >>>> >>> >>> >>> >> >> -- >> >> >> >> ------------------------------------------------------- >> SF.Net email is Sponsored by the Better Software Conference & EXPO >> September 19-22, 2005 * San Francisco, CA * Development Lifecycle >> Practices >> Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA >> Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf >> _______________________________________________ >> Nagios-users mailing list >> Nagios-users at lists.sourceforge.net >> https://lists.sourceforge.net/lists/listinfo/nagios-users >> ::: Please include Nagios version, plugin version (-v) and OS when >> reporting any issue. >> ::: Messages without supporting info will risk being sent to /dev/null >> > > > > -- -- ---- Don't Tell Me How Hard You Work.. Show Me How Much You'd Accomplished.. -- ---- Don't Tell Me How Hard You Work.. Show Me How Much You'd Accomplished.. -------------- next part -------------- An HTML attachment was scrubbed... URL: From elizar.palad at gmail.com Fri Sep 2 02:58:23 2005 From: elizar.palad at gmail.com (Elizar M. Palad) Date: Fri, 2 Sep 2005 08:58:23 +0800 Subject: @sysconfdir@/cgi.cfg - Installation setup problem In-Reply-To: <001b01c5af59$02c45aa0$0d7810ac@wgbh.org> References: <001b01c5af59$02c45aa0$0d7810ac@wgbh.org> Message-ID: Hi Todd, Thanks for the tip! Will try. Eli On 9/2/05, Todd Barbera wrote: > > Hi Eli, > You can actually use crle to set your library path instead of having to > set an environment variable. It seems to work pretty well on my Solaris > systems. Ex: > crle -l /usr/lib:/usr/ccs/lib:/usr/local/lib:/usr/local/ssl/lib > Todd > > ----- Original Message ----- > *From:* Elizar M. Palad > *To:* Subhendu Ghosh > *Cc:* nagios-users at lists.sourceforge.net > *Sent:* Thursday, September 01, 2005 7:46 PM > *Subject:* Re: [Nagios-users] @sysconfdir@/cgi.cfg - Installation setup > problem > > Hi Subhendu! > Found it! :-) > http://www.nagios.org/faqs/viewfaq.php?faq_id=17 > Will continue to learn nagios! > Thanks for the help! > and you guys will hear from me soon, i bet. ;) > Thanks! > eli > > On 9/2/05, Elizar M. Palad wrote: > > > > Hi, > > Actually, in /usr/local/nagios/etc there are now some *sample files. > > the resource.cfg that i do use has the @libexec@ and the > > resource.cfg-sample > > file has the one set in /usr/local/nagios/libexec, so used that one. > > When i rerun nagios, i got the > > Return code of 137 is out of bounds > > in the webpage and > > ld.so.1: /usr/local/nagios/libexec/check_ping: fatal: libiconv.so.2: > > open failed: No such file or directory > > ing the terminal.. > > although i have the libiconv.so.2 file in /usr/local/lib > > I read somewhere that i have to set some variable to the path > > of the library but i forgot where/what it is? LIB_... something.. :) > > Thanks! > > > > On 9/1/05, Subhendu Ghosh wrote: > > > > > > configure looks good here. > > > > > > In /usr/local/nagios/etc - do you only have one resource.cfg file? or > > > do > > > you have resource.cfg.in as well? > > > > > > you might want to delete every thing in /usr/local/nagios/etc and run > > > "make install-config" again to get the base sample-config > > > > > > -sg > > > > > > On Thu, 1 Sep 2005, Elizar M. Palad wrote: > > > > > > > Hi Subhendu, > > > > I rerun the ./configure... same options and i couldn't find any > > > critical > > > > errors. > > > > I attached the output produced. The only thing that looks like an > > > error is > > > > the gd, which, according to the documentation, nagios will run > > > without it. > > > > > > > > # ./configure --prefix=/usr/local/nagios > > > --with-cgiurl=/nagios/cgi-bin > > > > --with-htmlurl=/nagios --with-nagios-user=na > > > > gios --with-nagios-groupp=nagios --with-gd-lib=/usr/local/lib > > > > checking for a BSD compatible install... ./install-sh -c > > > > checking host system type... sparc-sun-solaris2.6 > > > > checking for gcc... gcc > > > > checking whether the C compiler (gcc ) works... yes > > > > checking whether the C compiler (gcc ) is a cross-compiler... no > > > > checking whether we are using GNU C... yes > > > > checking whether gcc accepts -g... yes > > > > checking whether make sets ${MAKE}... yes > > > > checking for strip... /usr/ccs/bin/strip > > > > checking how to run the C preprocessor... gcc -E > > > > checking for ANSI C header files... yes > > > > checking whether time.h and sys/time.h may both be included... yes > > > > checking for sys/wait.h that is POSIX.1 compatible... yes > > > > checking for arpa/inet.h... yes > > > > checking for ctype.h... yes > > > > checking for dirent.h.. . yes > > > > checking for errno.h... yes > > > > checking for fcntl.h... yes > > > > checking for getopt.h... no > > > > checking for grp.h... yes > > > > checking for limits.h... yes > > > > checking for math.h... yes > > > > checking for netdb.h... yes > > > > checking for netinet/in.h... yes > > > > checking for pthread.h... yes > > > > checking for pthreads.h... no > > > > checking for pwd.h... yes > > > > checking for regex.h... yes > > > > checking for signal.h... yes > > > > checking for socket.h... no > > > > checking for string.h... yes > > > > checking for strings.h... yes > > > > checking for sys/mman.h... yes > > > > checking for sys/types.h... yes > > > > checking for sys/time.h... yes > > > > checking for sys/resource.h... yes > > > > checking for sys/wait.h... (cached) yes > > > > checking for sys/socket.h... yes > > > > checking for sys/stat.h... yes > > > > checking for sys/timeb.h... yes > > > > checking for sys/un.h... yes > > > > checking for sys/ipc.h... yes > > > > checking for sys/msg.h... yes > > > > checking for sys/poll.h... yes > > > > checking for syslog.h... yes > > > > checking for uio.h... no > > > > checking for unistd.h... yes > > > > checking for working const... yes > > > > checking whether struct tm is in sys/time.h or time.h... time.h > > > > checking for tm_zone in struct tm... no > > > > checking for tzname... yes > > > > checking for mode_t... yes > > > > checking for pid_t... yes > > > > checking for size_t... yes > > > > checking return type of signal handlers... void > > > > checking for uid_t in sys/types.h... yes > > > > checking type of array argument to getgroups... gid_t > > > > checking for initgroups... yes > > > > checking for setenv... no > > > > checking for strdup... yes > > > > checking for strstr... yes > > > > checking for strtoul... yes > > > > checking for unsetenv... no > > > > checking for type of socket size... size_t > > > > checking for pthread_create in -lcma... no > > > > checking for pthread_create in -lpthread... yes > > > > checking for library containing nanosleep... -lposix4 > > > > checking for mail... /usr/bin/mail > > > > Init script directory: /etc/init.d > > > > We'll use default routines (in xdata/xsddefault.*) for status data > > > I/O... > > > > We'll use default routines (in xdata/xcddefault.*) for comment data > > > I/O... > > > > We'll use default routines (in xdata/xrddefault.*) for retention > > > data I/O... > > > > We'll use template-based routines (in xdata/xodtemplate.*) for > > > object data > > > > I/O... > > > > We'll use default routines (in xdata/xpddefault.*) for performance > > > data > > > > I/O... > > > > We'll use default routines (in xdata/xdddefault.*) for scheduled > > > downtime > > > > data I/O... > > > > checking for main in -liconv... yes > > > > checking for gdImagePng in -lgd (order 1)... no > > > > checking for gdImagePng in -lgd (order 2)... no > > > > checking for gdImagePng in -lgd (order 3)... no > > > > checking for gdImagePng in -lgd (order 4)... no > > > > > > > > > > > > *** GD, PNG, and/or JPEG libraries could not be located... ********* > > > > > > > > Boutell's GD library is required to compile the statusmap, trends > > > > and histogram CGIs. Get it from http://www.boutell.com/gd/, compile > > > > it, and use the --with-gd-lib and --with-gd-inc arguments to specify > > > > > > > the locations of the GD library and include files. > > > > > > > > NOTE: In addition to the gd-devel library, you'll also need to make > > > > sure you have the png-devel and jpeg-devel libraries installed > > > > on your system. > > > > > > > > NOTE: After you install the necessary libraries on your system: > > > > 1. Make sure /etc/ld.so.conf has an entry for the directory in > > > > which the GD, PNG, and JPEG libraries are installed. > > > > 2. Run 'ldconfig' to update the run-time linker options. > > > > 3. Run 'make clean' in the Nagios distribution to clean out > > > > any old references to your previous compile. > > > > 4. Rerun the configure script. > > > > > > > > NOTE: If you can't get the configure script to recognize the GD libs > > > > > > > on your system, get over it and move on to other things. The > > > > CGIs that use the GD libs are just a small part of the entire > > > > Nagios package. Get everything else working first and then > > > > revisit the problem. Make sure to check the nagios-users > > > > mailing list archives for possible solutions to GD library > > > > problems when you resume your troubleshooting. > > > > > > > > ******************************************************************** > > > > > > > > > > > > checking for ltdl.h... no > > > > checking for dlfcn.h.. . yes > > > > checking for dlopen in -ldl... yes > > > > checking for extra flags needed to export symbols... none > > > > checking for linker flags for loadable modules... -G > > > > checking for linker flags for loadable modules... -G > > > > checking for traceroute... no > > > > checking for snprintf... yes > > > > checking for type va_list... yes > > > > checking for perl... /usr/local/bin/perl > > > > creating ./config.status > > > > creating Makefile > > > > creating subst > > > > creating pkginfo > > > > creating base/Makefile > > > > creating common/Makefile > > > > creating contrib/Makefile > > > > creating cgi/Makefile > > > > creating html/Makefile > > > > creating module/Makefile > > > > creating include/Makefile > > > > creating xdata/Makefile > > > > creating daemon-init > > > > creating html/index.html > > > > creating html/side.html > > > > creating include/config.h > > > > include/config.h is unchanged > > > > creating include/snprintf.h > > > > include/snprintf.h is unchanged > > > > creating include/nagios.h > > > > creating include/cgiutils.h > > > > include/cgiutils.h is unchanged > > > > > > > > Creating sample config files in sample-config/ ... > > > > > > > > > > > > *** Configuration summary for nagios 2.0b4 08-02-2005 ***: > > > > > > > > General Options: > > > > ------------------------- > > > > Nagios executable: nagios > > > > Nagios user/group: nagios,nagios > > > > Command user/group: nagios,nagios > > > > Embedded Perl: no > > > > Event Broker: yes > > > > Install ${prefix}: /usr/local/nagios > > > > Lock file: ${prefix}/var/nagios.lock > > > > Init directory: /etc/init.d > > > > Host OS: solaris2.6 > > > > > > > > Web Interface Options: > > > > ------------------------ > > > > HTML URL: http://localhost/nagios/ > > > > CGI URL: http://localhost/nagios/cgi-bin/ > > > > Traceroute (used by WAP): > > > > > > > > > > > > Review the options above for accuracy. If they look okay, > > > > type 'make all' to compile the main program and CGIs. > > > > > > > > > > > > On 9/1/05, Subhendu Ghosh < sghosh at sghosh.org> wrote: > > > >> > > > >> On Thu, 1 Sep 2005, Elizar M. Palad wrote: > > > >> > > > >>> Hi! Ok, i rerun configure script after installed perl, seems ok. > > > no > > > >> error > > > >>> and all > > > >>> the .h files are created and there's no more error in make > > > >> install-config (i > > > >>> had one before) > > > >>> i am using the minimal.cfg to test nagios. > > > >>> I started nagios without any options. > > > >>> /usr/local/nagios/bin/nagios /usr/local/nagios/etc/nagios.cfg > > > >>> and it says: > > > >>> sh: @libexecdir@/check_ping: not found > > > >>> sh: @libexecdir@/check_users: not found > > > >>> sh: @libexecdir@/check_procs: not found > > > >>> Is this ok? > > > >> > > > >> No - @libexecdir@ should have been fixed by configure. > > > >> > > > >> For nagios - there should be no errors in the configure process... > > > >> > > > >> > > > >>> The IE page now has an error... > > > >>> It appears as though you do not have permission to view > > > information for > > > >>> any of the hosts you requested... > > > >>> > > > >>> If you believe this is an error, check the HTTP server > > > authentication > > > >>> requirements for accessing this CGI > > > >>> and check the authorization options in your CGI configuration > > > file. > > > >>> I didn't set up any authentication for nagios. Maybe it caused the > > > error > > > >>> above? > > > >>> if it does, how to use nagios without authentication? > > > >>> thanks! > > > >>> > > > >>> > > > >>> On 8/31/05, Subhendu Ghosh wrote: > > > >>>> > > > >>>> > > > >>>> Yes - autoconf which creates configure uses perl :) > > > >>>> > > > >>>> You can either compile your own or sunfreeware.com > > > > > > >> < http://sunfreeware.com>has some nice pkgadd > > > >>>> versions. > > > >>>> > > > >>>> -sg > > > >>>> > > > >>>> > > > >>>> On Wed, 31 Aug 2005, Elizar M. Palad wrote: > > > >>>> > > > >>>>> Hi, thanks! > > > >>>>> from ./configure's output: > > > >>>>> > > > >>>>> creating html/index.html > > > >>>>> creating html/side.html > > > >>>>> creating include/config.h > > > >>>>> creating include/snprintf.h > > > >>>>> creating include/nagios.h > > > >>>>> creating include/cgiutils.h > > > >>>>> ./configure: perl: not found > > > >>>>> ./configure: perl: not found > > > >>>>> > > > >>>>> Creating sample config files in sample-config/ ... > > > >>>>> ./configure: perl: not found > > > >>>>> ./configure: perl: not found > > > >>>>> ./configure: perl: not found > > > >>>>> ./configure: perl: not found > > > >>>>> ./configure: perl: not found > > > >>>>> ./configure: perl: not found > > > >>>>> ./configure: perl: not found > > > >>>>> ./configure: perl: not found > > > >>>>> > > > >>>>> > > > >>>>> *** Configuration summary for nagios 2.0b4 08-02-2005 ***: > > > >>>>> > > > >>>>> General Options: > > > >>>>> ------------------------- > > > >>>>> Nagios executable: nagios > > > >>>>> Nagios user/group: nagios,nagios > > > >>>>> Command user/group: nagios,nagios > > > >>>>> Embedded Perl: no > > > >>>>> Event Broker: yes > > > >>>>> Install ${prefix}: /usr/local/nagios > > > >>>>> Lock file: ${prefix}/var/nagios.lock > > > >>>>> Init directory: /etc/init.d > > > >>>>> Host OS: solaris2.6 > > > >>>>> .... > > > >>>>> Im going to install now perl and see if it will create the > > > >> location.hfile. > > > >>>>> Thanks! > > > >>>>> eli > > > >>>>> > > > >>>>> On 8/31/05, Subhendu Ghosh wrote: > > > >>>>>> > > > >>>>>> On Wed, 31 Aug 2005, Elizar M. Palad wrote: > > > >>>>>> > > > >>>>>>> I rerun ./configure --prefix=/usr/local/nagios > > > >>>>>> --with-cgiurl=/nagios/cgi-bin > > > >>>>>>> --with-htmlurl=/nagios --with-nagios-user=nagios > > > >>>>>> --with-nagios-groupp=nagios > > > >>>>>>> --with-gd-lib=/usr/local/lib > > > >>>>>>> and summary is below: > > > >>>>>>> *** Configuration summary for nagios 2.0b4 08-02-2005 ***: > > > >>>>>>> > > > >>>>>>> General Options: > > > >>>>>>> ------------------------- > > > >>>>>>> Nagios executable: nagios > > > >>>>>>> Nagios user/group: nagios,nagios > > > >>>>>>> Command user/group: nagios,nagios > > > >>>>>>> Embedded Perl: no > > > >>>>>>> Event Broker: yes > > > >>>>>>> Install ${prefix}: /usr/local/nagios > > > >>>>>>> Lock file: ${prefix}/var/nagios.lock > > > >>>>>>> Init directory: /etc/init.d > > > >>>>>>> Host OS: solaris2.6 > > > >>>>>>> > > > >>>>>>> Web Interface Options: > > > >>>>>>> ------------------------ > > > >>>>>>> HTML URL: http://localhost/nagios/ > > > >>>>>>> CGI URL: http://localhost/nagios/cgi-bin/ > > > >>>>>>> Traceroute (used by WAP): > > > >>>>>>> > > > >>>>>>> > > > >>>>>>> Review the options above for accuracy. If they look okay, > > > >>>>>>> type 'make all' to compile the main program and CGIs. > > > >>>>>>> ------------------ > > > >>>>>>> Looks ok isn't it? > > > >>>>>>> then when i do 'make all', i have this error: > > > >>>>>>> # make all > > > >>>>>>> cd ./base && make > > > >>>>>>> make: Fatal error: Don't know how to make target > > > >>>>>> `../include/locations.h' > > > >>>>>>> Current working directory /packages/nagios-2.0b4/base > > > >>>>>>> *** Error code 1 > > > >>>>>>> make: Fatal error: Command failed for target `all' > > > >>>>>>> This is also my second time with this error. What I did was i > > > copy > > > >>>>>>> locations.h.in > > > < http://locations.h.in> < > > > >> http://locations.h.in> < > > > >>>> http://locations.h.in> as > > > >>>>>> locations.h :-) > > > >>>>>>> and the compilation continues.. (was that not right? :) > > > >>>>>>> thanks! > > > >>>>>> > > > >>>>>> > > > >>>>>> NOOOOO - configure must have produced an error before the > > > summary was > > > >>>>>> printed. > > > >>>>>> > > > >>>>>> configure should create a locations.h from locations.h.in > > > < http://locations.h.in> > > > >> < http://locations.h.in> > > > >>>> < http://locations.h.in>after > > > >>>>>> appropriate substitutions. > > > >>>>>> > > > >>>>>> If the file is not being created properly - you may have a file > > > >>>> permission > > > >>>>>> issue. > > > >>>>>> > > > >>>>>> run "chown -R user:group nagios_src_dir/" > > > >>>>>> > > > >>>>>> replace the user and group with the user and group info fort he > > > > > > >> person > > > >>>>>> running configure > > > >>>>>> > > > >>>>>> replace nagios_src_dir with the dir that contains the nagios > > > src. > > > >>>>>> > > > >>>>>> Never copy a *.*.in file into a *.* file. > > > >>>>>> > > > >>>>>> > > > >>>>>> > > > >>>>>> -- > > > >>>>>> > > > >>>>>> -sg > > > >>>>>> > > > >>>>> > > > >>>>> > > > >>>>> > > > >>>>> > > > >>>> > > > >>>> -- > > > >>>> > > > >>>> > > > >>> > > > >>> > > > >>> > > > >> > > > >> -- > > > >> > > > >> > > > >> > > > >> ------------------------------------------------------- > > > >> SF.Net email is Sponsored by the Better Software Conference & EXPO > > > >> September 19-22, 2005 * San Francisco, CA * Development Lifecycle > > > >> Practices > > > >> Agile & Plan-Driven Development * Managing Projects & Teams * > > > Testing & QA > > > >> Security * Process Improvement & Measurement * > > > http://www.sqe.com/bsce5sf > > > >> _______________________________________________ > > > >> Nagios-users mailing list > > > >> Nagios-users at lists.sourceforge.net > > > >> https://lists.sourceforge.net/lists/listinfo/nagios-users > > > >> ::: Please include Nagios version, plugin version (-v) and OS when > > > >> reporting any issue. > > > >> ::: Messages without supporting info will risk being sent to > > > /dev/null > > > >> > > > > > > > > > > > > > > > > > > > > > > -- > > > > > > > > > > > > -- > > ---- > > Don't Tell Me How Hard You Work.. > > Show Me How Much You'd Accomplished.. > > > > > > -- > ---- > Don't Tell Me How Hard You Work.. > Show Me How Much You'd Accomplished.. > > -- ---- Don't Tell Me How Hard You Work.. Show Me How Much You'd Accomplished.. -------------- next part -------------- An HTML attachment was scrubbed... URL: From m.borsani at it.net Fri Sep 2 09:38:53 2005 From: m.borsani at it.net (Marco Borsani) Date: Fri, 2 Sep 2005 09:38:53 +0200 Subject: R: Ranges for check_snmp In-Reply-To: References: Message-ID: 1) Do you install any pathces regarding check_snmp ? 2) Reading the check_snmp help I see that "," (comma) is used to separate ranges for differents OIDs. In your example I see only one OID...Is it correct? My check_snmp version is "(nagios-plugins 1.3.1) 1.24.2.2" Regards Marco -}-----Messaggio originale----- -}Da: Justin Shore [mailto:justin.shore at sktbcs.com] -}Inviato: giovedi 1 settembre 2005 17.49 -}A: Marco Borsani; NAGIOS -}Oggetto: RE: [Nagios-users] Ranges for check_snmp -} -} -}Here is what I'm using to check temp on a Cisco 6509. The syntax -}should be applicable in your scenario. -} -}check_command -}check_snmp!1.3.6.1.4.1.9.9.91.1.1.1.1.4.4001!COMMUNITY-STRING!27,30!30,150 -} -}My check_snmp is defined as: -} -}# 'check_snmp' command definition -}define command{ -} command_name check_snmp -} command_line $USER1$/check_snmp -H $HOSTADDRESS$ -o -}$ARG1$ -C $ARG2$ -w $ARG3$ -c $ARG4$ -} } -} -}This translates into: -} -}check_snmp -H $HOSTADDRESS -o 1.3.6.1.4.1.9.9.91.1.1.1.1.4.4001 -}-C COMMUNITY-STRING -w 27,30 -c 30,150 -} -}Justin -} -} -}> -----Original Message----- -}> From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users- -}> admin at lists.sourceforge.net] On Behalf Of Marco Borsani -}> Sent: Thursday, September 01, 2005 3:42 AM -}> To: NAGIOS -}> Subject: [Nagios-users] Ranges for check_snmp -}> Importance: High -}> -}> Hi all ! -}> -}> I'd like to check some metrics on my firewall, like CPU but -}when I try to -}> set the ranges for warning and critical values I receive "strange" -}> answers. -}> -}> ./check_snmp -H HOSTADDRESS -o .1.3.6.1.4.1.9.9.109.1.1.1.1.3.1 -}-C public -}> SNMP WARNING - 9 -}> -}> ./check_snmp -H HOSTADDRESS -o .1.3.6.1.4.1.9.9.109.1.1.1.1.3.1 -}-C public -}> -w -}> 15:24 -c 25:100 -}> SNMP CRITICAL - *7* -}> -}> Why? What's my fault ? -}> -}> I need to receive a Warning over 15% and a Critical over 25%. -}> -}> Regards -}> -}> Marco Borsani -}> Unix & Monitoring System Administrator -}> Technical Operation -}> Tel. +39 010 4310115 -}> Fax +39 010 4327454 -}> E-mail: m.borsani at IT.net -}> -}> ITnet S.r.l. - Direzione e Coordinamento di WIND -}Telecomunicazioni S.p.A. -}> Internet Service Provider -}> Sede legale: Via C.G.Viola, 48 - 00148 Roma -}> Dir. Centrale e Amministrativa: Via Pacinotti, 39 -}> 16151 Genova (Italy) -}> -}> http://www.it.net -}> mailto:info at IT.net -}> _______________________________________________________________ -}> Altre sedi ITnet: -}> MILANO tel.: +39 02 30114900 info-milano at IT.net -}> ROMA tel.: +39 06 83116707 info-roma at IT.net -}> _______________________________________________________________ -}> ITnet is associated to CIX (Commercial IP eXchange) and RIPE -}> ITnet is associated to AIIP (Associazione Italiana Internet Providers) -}> -}> -}> -}> -}> ------------------------------------------------------- -}> SF.Net email is Sponsored by the Better Software Conference & EXPO -}> September 19-22, 2005 * San Francisco, CA * Development Lifecycle -}> Practices -}> Agile & Plan-Driven Development * Managing Projects & Teams * -}Testing & QA -}> Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > > -- > No virus found in this incoming message. > Checked by AVG Anti-Virus. > Version: 7.0.344 / Virus Database: 267.10.17/84 - Release Date: 8/29/2005 > -- No virus found in this outgoing message. Checked by AVG Anti-Virus. Version: 7.0.344 / Virus Database: 267.10.17/84 - Release Date: 8/29/2005 ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From RLAdams at Kelsey-Seybold.com Fri Sep 2 17:12:42 2005 From: RLAdams at Kelsey-Seybold.com (Adams, Russell L.) Date: Fri, 2 Sep 2005 10:12:42 -0500 Subject: fork errors In-Reply-To: <8ee0610105090111133989e4e1@mail.gmail.com> References: <8ee0610105090111133989e4e1@mail.gmail.com> Message-ID: <20050902151241.GA28870@pingu.ksnet.com> What distro? I had issues on Debian Woody with fork errors, and had to update some values which improved the situation but never actually fixed it. In sysctl.conf: kernel/threads-max = 100000 In /etc/security/limits: root soft nproc 1024 nagios soft nproc 1024 root hard nproc 2048 nagios hard nproc 2048 This kernel threads value helped, but there seems that there's another value hardcoded into the kernel that can only be changed via recompile. I can't remember what value that was, but it was set way too low in the default 2.4 kernel from Debian for Woody. I've been meaning to see what happens with Sarge. Russell On Thu, Sep 01, 2005 at 01:13:20PM -0500, Terry wrote: > Hello, > > I have been having this issue for quite some time. For some unknown > reason, nagios stops performing checks with these errors: > > [1125536952] Warning: The check of service 'PING' on host 'hostname' > could not be performed due to a fork() error. The check will be > rescheduled. > > All checks fail like this until nagios is restarted. When this > problem is occuring I can run the service checks manually both as the > nagios user and as the root user. There are no resource problems that > I can see at the time. We do not appear to be hitting a limit with > open files or anything like that either. The nagios mirrors the root > user in that area. > > What could be wrong? > > Thanks! > > > ------------------------------------------------------- > SF.Net email is Sponsored by the Better Software Conference & EXPO > September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices > Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA > Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Matthias.Eble at kaufland.de Fri Sep 2 17:04:41 2005 From: Matthias.Eble at kaufland.de (Matthias.Eble at kaufland.de) Date: Fri, 2 Sep 2005 17:04:41 +0200 Subject: Nagios - J2EE server [Virus checked] In-Reply-To: References: Message-ID: > Hi All, > > Is there a plug-in available to check the health of application server (Web > Logic or any App Server) instance? > Hi, we're currently monitoring just the http availability of our jboss servers. JBoss includes a small java Program called twiddle (in the bin directory) to read Values from JMX (eg Tomcat ActiveThreads, MaxThreads or JDBC Poolsize/AvailableConnections). It could be called/wrapped, but I guess calling a jvm for every check wouldn't be that clever. Is it less performance consuming to parse the html output of the jmx console? matthias ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From td3201 at gmail.com Fri Sep 2 16:41:14 2005 From: td3201 at gmail.com (Terry) Date: Fri, 2 Sep 2005 09:41:14 -0500 Subject: fork errors In-Reply-To: <20050901233155.96884.qmail@web31915.mail.mud.yahoo.com> References: <8ee0610105090111133989e4e1@mail.gmail.com> <20050901233155.96884.qmail@web31915.mail.mud.yahoo.com> Message-ID: <8ee0610105090207415fef2316@mail.gmail.com> I have a program that checks the logs by the minute and pages when the fork errors occur, so we are responding within minutes. I have looked at the resources every time it happens and we have plenty of resources. Is there a single plugin I can put into debugging mode so that when this happens I get more information as to why it is giving these errors? Here are a few facts: - the system is fine with memory all the time, never runs out (resident/paging) - there are not an unusual amount of processes running, maybe around 200 at a time, but no where near the ulimit setting - ulimit for the 'nagios' user matches that of root (unlimited). here is the ulimit: core file size (blocks, -c) 0 data seg size (kbytes, -d) unlimited file size (blocks, -f) unlimited max locked memory (kbytes, -l) 4 max memory size (kbytes, -m) unlimited open files (-n) 1024 pipe size (512 bytes, -p) 8 stack size (kbytes, -s) 10240 cpu time (seconds, -t) unlimited max user processes (-u) 7168 virtual memory (kbytes, -v) unlimited Thanks, Terry On 9/1/05, Fred wrote: > My guess would be to look at your resource utilization on your system, > most likely causes for fork() to fail are no more process slots, out of > memory, or past some kind of per-user (non-root) limit. When this > occurs look at your system logs, ps output and see if you have *lots* > of processes hanging around. It could be that nagios has stopped reaping > its children (or another unrelated process has sucked up the resources) > and you have simply pushed your system to the edge. It might be that you > get to that situation and it backs off before you even notice it and you > are left with nagios having problems dealing with the aftermath. > > -FredC > > --- Terry wrote: > > > Hello, > > > > I have been having this issue for quite some time. For some unknown > > reason, nagios stops performing checks with these errors: > > > > [1125536952] Warning: The check of service 'PING' on host 'hostname' > > could not be performed due to a fork() error. The check will be > > rescheduled. > > > > All checks fail like this until nagios is restarted. When this > > problem is occuring I can run the service checks manually both as the > > nagios user and as the root user. There are no resource problems that > > I can see at the time. We do not appear to be hitting a limit with > > open files or anything like that either. The nagios mirrors the root > > user in that area. > > > > What could be wrong? > > > > Thanks! > > > > > > ------------------------------------------------------- > > SF.Net email is Sponsored by the Better Software Conference & EXPO > > September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices > > Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA > > Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf > > _______________________________________________ > > Nagios-users mailing list > > Nagios-users at lists.sourceforge.net > > https://lists.sourceforge.net/lists/listinfo/nagios-users > > ::: Please include Nagios version, plugin version (-v) and OS when reporting > > any issue. > > ::: Messages without supporting info will risk being sent to /dev/null > > > > > > > > ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From td3201 at gmail.com Fri Sep 2 17:28:03 2005 From: td3201 at gmail.com (Terry) Date: Fri, 2 Sep 2005 10:28:03 -0500 Subject: fork errors In-Reply-To: <20050902151241.GA28870@pingu.ksnet.com> References: <8ee0610105090111133989e4e1@mail.gmail.com> <20050902151241.GA28870@pingu.ksnet.com> Message-ID: <8ee0610105090208282bbe40f1@mail.gmail.com> This is RHEL 3. On 9/2/05, Adams, Russell L. wrote: > What distro? > > I had issues on Debian Woody with fork errors, and had to update some > values which improved the situation but never actually fixed it. > > In sysctl.conf: > > kernel/threads-max = 100000 > > In /etc/security/limits: > > root soft nproc 1024 > nagios soft nproc 1024 > root hard nproc 2048 > nagios hard nproc 2048 > > This kernel threads value helped, but there seems that there's another > value hardcoded into the kernel that can only be changed via > recompile. I can't remember what value that was, but it was set way > too low in the default 2.4 kernel from Debian for Woody. > > I've been meaning to see what happens with Sarge. > > Russell > > > On Thu, Sep 01, 2005 at 01:13:20PM -0500, Terry wrote: > > Hello, > > > > I have been having this issue for quite some time. For some unknown > > reason, nagios stops performing checks with these errors: > > > > [1125536952] Warning: The check of service 'PING' on host 'hostname' > > could not be performed due to a fork() error. The check will be > > rescheduled. > > > > All checks fail like this until nagios is restarted. When this > > problem is occuring I can run the service checks manually both as the > > nagios user and as the root user. There are no resource problems that > > I can see at the time. We do not appear to be hitting a limit with > > open files or anything like that either. The nagios mirrors the root > > user in that area. > > > > What could be wrong? > > > > Thanks! > > > > > > ------------------------------------------------------- > > SF.Net email is Sponsored by the Better Software Conference & EXPO > > September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices > > Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA > > Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf > > _______________________________________________ > > Nagios-users mailing list > > Nagios-users at lists.sourceforge.net > > https://lists.sourceforge.net/lists/listinfo/nagios-users > > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > > ::: Messages without supporting info will risk being sent to /dev/null > > > ------------------------------------------------------- > SF.Net email is Sponsored by the Better Software Conference & EXPO > September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices > Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA > Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From robmossrm at aol.com Fri Sep 2 16:41:45 2005 From: robmossrm at aol.com (Rob Moss) Date: Fri, 02 Sep 2005 15:41:45 +0100 Subject: statusmap icons In-Reply-To: References: <99CF04974931C548B2BF20CD898FCBA7AF4643@uscpgmedexch01.medassets.com> Message-ID: <431864A9.1090806@aol.com> See below for example. Subhendu Ghosh wrote: > On Thu, 1 Sep 2005, Gonzales, Youn wrote: > >> The GD2 images work on the statusmap, but do not work on the popup, so >> it looks like I can either have the image on the map or the image on the >> popup, but not both. > > >> From docs: > > http://nagios.sourceforge.net/docs/2_0/xodtemplate.html#hostextinfo > > define hostextinfo{ > host_name host_name > notes note_string > notes_url url > action_url url > icon_image image_file > icon_image_alt alt_string > vrml_image image_file > statusmap_image image_file > 2d_coords x_coord,y_coord > 3d_coords x_coord,y_coord,z_coord > } > > Note: > icon_image/vrml_image - should be png/gif/jpg > statusmap_image - should be gd2 > Here is a working example /usr/local/nagios/etc/hostextinfo.cfg: define hostextinfo{ host_name freebsd.somewhere.com notes frizzee bizzee icon_image freebsd40.png icon_image_alt FreeBSD 5.4 vrml_image freebsd40.png statusmap_image freebsd40.gd2 2d_coords 1000,2500 3d_coords 100.0,50.0,75.0 } The logo icon pack is installed in /usr/local/nagios/share/images/logos rmoss at freebsd [/usr/local/nagios/share/images/logos]$ ls -la *freebsd* -rw-r--r-- 1 500 500 1455 Sep 23 2001 freebsd40.gd2 -rw-r--r-- 1 500 500 1491 Sep 23 2001 freebsd40.gif -rw-r--r-- 1 500 500 1697 Sep 23 2001 freebsd40.jpg -rw-r--r-- 1 500 500 1627 Sep 23 2001 freebsd40.png Cheers rob. >> >> -----Original Message----- >> From: Arno Lehmann [mailto:al at its-lehmann.de] >> Sent: Thursday, September 01, 2005 4:23 PM >> To: Gonzales, Youn >> Cc: NAGIOS >> Subject: Re: [Nagios-users] statusmap icons >> >> Hi, >> >> Gonzales, Youn wrote: >> >>> I am not able to view gd2 images in internet explorer. Is there a >> >> plugin >> >>> or something I need to install? >> >> >> No, you don't view them directly. They are used by libgd to create the >> statusmap image. >> >> You can simply convert one of your gifs to a gd2 format image, modify >> the configuration concerning hostextinfo to use the gd image, and see >> what happens. See the manual secion on extended information >> configuration for details, but note that my experience was that only gd >> images worked. I never tried to determine if that was a configuration >> error, an incorrect manual description, or anything else. >> >> With Nagios 1.2 and 2.0b3, by the way. >> >> Arno >> >>> -----Original Message----- >>> From: Arno Lehmann [mailto:al at its-lehmann.de] >>> Sent: Thursday, September 01, 2005 3:30 PM >>> To: Gonzales, Youn >>> Cc: NAGIOS >>> Subject: Re: [Nagios-users] statusmap icons >>> >>> Hi, >>> >>> Gonzales, Youn wrote: >>> >>> >>>> I am running 2.0b4 on Fedora 4. I can't seem to get the statusmap >>> >>> >>> icons >>> >>>> to work properly. >>>> >>>> define hostextinfo{ >>>> host_name uscpgls1010 >>>> icon_image network_switch.gif >>>> statusmap_image network_switch.gif >>>> } >>>> >>>> The icons show up in the status views - ie host detail - and when I >>>> float over the device on the statusmap the icons are in the top left >>>> corner of the popup. But, the icons on the status map are all the >>>> unknown.gif icons. >>>> >>>> I can browse all of the icons, so it is not an apache or permissions >>>> issue. Any suggestions? >>> >>> >>> >>> Yes, try using gd2 images. his worked here, I don't know what the >> >> manual >> >>> >>> has to say... >>> >>> Arno >>> > -- Rob Moss Unix Systems Admin Hosting & DB Operations Hammersmith, London, UK ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From naguser at bhodisoft.com Fri Sep 2 19:24:06 2005 From: naguser at bhodisoft.com (naguser at bhodisoft.com) Date: Fri, 2 Sep 2005 10:24:06 -0700 (PDT) Subject: question about service dependencies on the same host Message-ID: <59635.209.74.96.17.1125681846.squirrel@www.robinsoncomputerservice.com> I'm running a number of checks that use the netsaint client extension. When that client fails, then we get notifications on all of the service checks that client provides. I'd like to set up a dependency where those checks/notifications won't be done if the client is down. The main issue I'm running into is that the servicedependancy definition doesn't appear to support macros or multiple hosts, which means I'm going to have to manually create them all individually. Is there an easier way to do this that I'm missing? Thanks. -G_E ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From robmossrm at aol.com Fri Sep 2 17:46:07 2005 From: robmossrm at aol.com (Rob Moss) Date: Fri, 02 Sep 2005 16:46:07 +0100 Subject: Nagios - J2EE server In-Reply-To: References: Message-ID: <431873BF.1060101@aol.com> Hi, I haven't checked the contributed plugins, but I'm looking at the same kind of checks. My current plan is to use the check_http plugin (or my own custom built SSL enabled one) and have a .JSP page on the webserver which executes some checks on the weblogic/tomcat server and returns HTTP 200 success if all checks are working, and HTTP 500 error (with the error message) if it fails. IF apache/webserver isn't running, there will be an error IF tomcat/weblogic isn't running, there will be an error IF the JSP page fails any of the checks, there will be an error ELSE there is a success message. It would be nice for someone to write (if it isn't already written) a plugin which checks the Weblogic clustering, and notifies on state changes. I have limited knoweledge on Weblogic/Tomcat (i refuse to touch java. long live perl!) so I'm not much good there.. :-) Cheers rob. Praveen Muthyala Manohar wrote: >Hi All, > >Is there a plug-in available to check the health of application server (Web >Logic or any App Server) instance? > >Regards >Praveen M M > > >------------------------------------------------------- >SF.Net email is Sponsored by the Better Software Conference & EXPO >September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices >Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA >Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf >_______________________________________________ >Nagios-users mailing list >Nagios-users at lists.sourceforge.net >https://lists.sourceforge.net/lists/listinfo/nagios-users >::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. >::: Messages without supporting info will risk being sent to /dev/null > > > -- Rob Moss Unix Systems Admin Hosting & DB Operations Hammersmith, London, UK Phone: +44 20 7348 8629 ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From justin.shore at sktbcs.com Fri Sep 2 21:47:17 2005 From: justin.shore at sktbcs.com (Justin Shore) Date: Fri, 2 Sep 2005 14:47:17 -0500 Subject: Ranges for check_snmp Message-ID: No patches. The box is a Gentoo 2005.0 box with NET-SNMP 5.2.1 and a hand-compiled Nagios 2.0b4 install. My check_snmp is 1.57 from the 1.4.1 plugins. I can't speak to the OID and comma syntax since I don't have a need to check more than one OID in the same check command right now. Your initial question shouldn't require more than one OID either. Give that a try and see how it goes. Justin > -----Original Message----- > From: Marco Borsani [mailto:m.borsani at it.net] > Sent: Friday, September 02, 2005 2:39 AM > To: Justin Shore > Cc: NAGIOS > Subject: R: [Nagios-users] Ranges for check_snmp > > > 1) Do you install any pathces regarding check_snmp ? > > 2) Reading the check_snmp help I see that "," (comma) is used to separate > ranges for differents OIDs. In your example I see only one OID...Is it > correct? > > My check_snmp version is "(nagios-plugins 1.3.1) 1.24.2.2" > > Regards > > Marco > > -}-----Messaggio originale----- > -}Da: Justin Shore [mailto:justin.shore at sktbcs.com] > -}Inviato: giovedi 1 settembre 2005 17.49 > -}A: Marco Borsani; NAGIOS > -}Oggetto: RE: [Nagios-users] Ranges for check_snmp > -} > -} > -}Here is what I'm using to check temp on a Cisco 6509. The syntax > -}should be applicable in your scenario. > -} > -}check_command > -}check_snmp!1.3.6.1.4.1.9.9.91.1.1.1.1.4.4001!COMMUNITY- > STRING!27,30!30,150 > -} > -}My check_snmp is defined as: > -} > -}# 'check_snmp' command definition > -}define command{ > -} command_name check_snmp > -} command_line $USER1$/check_snmp -H $HOSTADDRESS$ -o > -}$ARG1$ -C $ARG2$ -w $ARG3$ -c $ARG4$ > -} } > -} > -}This translates into: > -} > -}check_snmp -H $HOSTADDRESS -o 1.3.6.1.4.1.9.9.91.1.1.1.1.4.4001 > -}-C COMMUNITY-STRING -w 27,30 -c 30,150 > -} > -}Justin > -} > -} > -}> -----Original Message----- > -}> From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users- > -}> admin at lists.sourceforge.net] On Behalf Of Marco Borsani > -}> Sent: Thursday, September 01, 2005 3:42 AM > -}> To: NAGIOS > -}> Subject: [Nagios-users] Ranges for check_snmp > -}> Importance: High > -}> > -}> Hi all ! > -}> > -}> I'd like to check some metrics on my firewall, like CPU but > -}when I try to > -}> set the ranges for warning and critical values I receive "strange" > -}> answers. > -}> > -}> ./check_snmp -H HOSTADDRESS -o .1.3.6.1.4.1.9.9.109.1.1.1.1.3.1 > -}-C public > -}> SNMP WARNING - 9 > -}> > -}> ./check_snmp -H HOSTADDRESS -o .1.3.6.1.4.1.9.9.109.1.1.1.1.3.1 > -}-C public > -}> -w > -}> 15:24 -c 25:100 > -}> SNMP CRITICAL - *7* > -}> > -}> Why? What's my fault ? > -}> > -}> I need to receive a Warning over 15% and a Critical over 25%. > -}> > -}> Regards > -}> > -}> Marco Borsani > -}> Unix & Monitoring System Administrator > -}> Technical Operation > -}> Tel. +39 010 4310115 > -}> Fax +39 010 4327454 > -}> E-mail: m.borsani at IT.net > -}> > -}> ITnet S.r.l. - Direzione e Coordinamento di WIND > -}Telecomunicazioni S.p.A. > -}> Internet Service Provider > -}> Sede legale: Via C.G.Viola, 48 - 00148 Roma > -}> Dir. Centrale e Amministrativa: Via Pacinotti, 39 > -}> 16151 Genova (Italy) > -}> > -}> http://www.it.net > -}> mailto:info at IT.net > -}> _______________________________________________________________ > -}> Altre sedi ITnet: > -}> MILANO tel.: +39 02 30114900 info-milano at IT.net > -}> ROMA tel.: +39 06 83116707 info-roma at IT.net > -}> _______________________________________________________________ > -}> ITnet is associated to CIX (Commercial IP eXchange) and RIPE > -}> ITnet is associated to AIIP (Associazione Italiana Internet Providers) > -}> > -}> > -}> > -}> > -}> ------------------------------------------------------- > -}> SF.Net email is Sponsored by the Better Software Conference & EXPO > -}> September 19-22, 2005 * San Francisco, CA * Development Lifecycle > -}> Practices > -}> Agile & Plan-Driven Development * Managing Projects & Teams * > -}Testing & QA > -}> Security * Process Improvement & Measurement * > http://www.sqe.com/bsce5sf > > _______________________________________________ > > Nagios-users mailing list > > Nagios-users at lists.sourceforge.net > > https://lists.sourceforge.net/lists/listinfo/nagios-users > > ::: Please include Nagios version, plugin version (-v) and OS when > > reporting any issue. > > ::: Messages without supporting info will risk being sent to /dev/null > > > > -- > > No virus found in this incoming message. > > Checked by AVG Anti-Virus. > > Version: 7.0.344 / Virus Database: 267.10.17/84 - Release Date: > 8/29/2005 > > > > -- > No virus found in this outgoing message. > Checked by AVG Anti-Virus. > Version: 7.0.344 / Virus Database: 267.10.17/84 - Release Date: 8/29/2005 > > > > -- > No virus found in this incoming message. > Checked by AVG Anti-Virus. > Version: 7.0.344 / Virus Database: 267.10.18/87 - Release Date: 9/1/2005 > -- No virus found in this outgoing message. Checked by AVG Anti-Virus. Version: 7.0.344 / Virus Database: 267.10.18/87 - Release Date: 9/1/2005 ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From drussell at mpifix.com Fri Sep 2 22:02:09 2005 From: drussell at mpifix.com (Danny Russell) Date: Fri, 2 Sep 2005 14:02:09 -0600 Subject: Downtime on hosts & services Message-ID: <028B223BB24E5443A9784BBE1CC5EBE054B530@Lisa.mpifix.com> I have many hosts setup with many services on each. Nagios is working and monitoring. I am running v1.2 I schedule downtime on a host, but not its services. I would think that when I schedule downtime on the host it would not bother me about the services on the host. I am still getting notifications about the services on that host. Why is this? Is this by design or is it an option to change somewhere? Thanks! Danny -------------- next part -------------- An HTML attachment was scrubbed... URL: From thasyndicate at yahoo.com Fri Sep 2 21:58:33 2005 From: thasyndicate at yahoo.com (tha syndicate) Date: Fri, 2 Sep 2005 12:58:33 -0700 (PDT) Subject: Nagios Installation problem Message-ID: <20050902195833.12331.qmail@web54004.mail.yahoo.com> Having a problem installing Nagios 1.2 on Fedora Core 4. I'm following the documention step by step in the documentation on Nagios site. When I get to the step to run /usr/local/nagios/bin/nagios -v /usr/local/nagios/etc/nagios.cfg to check my config file. I get the following feedback: Nagios 1.2 Copyright (c) 1999-2004 Ethan Galstad (nagios at nagios.org) Last Modified: 02-02-2004 License: GPL Reading configuration data... Error: Unexpected token or statement in file '/usr/local/nagios/etc/nagios.cfg' on line 19. ***> One or more problems was encountered while processing the config files... Check your configuration file(s) to ensure that they contain valid directives and data defintions. If you are upgrading from a previous version of Nagios, you should be aware that some variables/definitions may have been removed or modified in this version. Make sure to read the HTML documentation on the main and host config files, as well as the 'Whats New' section to find out what has changed. Line #19 in my config file is: log_file=/usr/local/nagios/var/nagios.log The log file is in the correct place so it does exist, I don't know what else to check for. Then, if I comment out that line in the config file, it just stops at the next line\file that's not commented out (cfg_file=/usr/local/nagios/etc/checkcommands.cfg) and gives me the same message, then if I comment that line out, its stops at the next line\file that's not commented and gives me the same thing.....so it looks as though its going to see some sort of issue with every single uncommented line in the config file for some reason. Anyone have any ideas on what could be wrong??? Also..... persmissions on the log file are: -rw-r--r-- 1 nagios nagios 876 Aug 29 15:26 nagios.log and permissions from the next file I stated in the original post is: -rwxrwxrwx 1 root root 4475 Aug 30 14:58 checkcommands.cfg Haven't checked the rest of the uncommented files permission since I'm not sure it's even an issue. After I run the command no additional info is posted in /var/log/messages Thanks, Jimmy __________________________________________________ Do You Yahoo!? Tired of spam? Yahoo! Mail has the best spam protection around http://mail.yahoo.com ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From f1216 at yahoo.com Fri Sep 2 22:16:29 2005 From: f1216 at yahoo.com (Fred) Date: Fri, 2 Sep 2005 13:16:29 -0700 (PDT) Subject: fork errors In-Reply-To: <8ee0610105090207415fef2316@mail.gmail.com> References: <8ee0610105090207415fef2316@mail.gmail.com> Message-ID: <20050902201629.7889.qmail@web31906.mail.mud.yahoo.com> Just for fun, you might try creating the problem and see how many forks you *can* get, for example: #!/usr/bin/perl my $c=0; do { my $pid = fork(); if ($pid) { $c++; print "\rchildcount $c "; } else { sleep(1); exit(0); } } while 1; to create as many procs as you can and test your limit. You would want to do this under the same environment as the nagios process runs. They will all be kept defunct until the process exits (when you hit the max processes you can create) The other thing you might try is to start nagios under strace -f and output the data to a log. You can specify just forks for strace, i.e., strace -f -e trace=process >/tmp/,log 2>&1 nagios .... That would give you a good handle on what is going on when the failure occurs. Might slow nagios down a bit, but probably nothing significant. -FredC Terry wrote: I have a program that checks the logs by the minute and pages when the fork errors occur, so we are responding within minutes. I have looked at the resources every time it happens and we have plenty of resources. Is there a single plugin I can put into debugging mode so that when this happens I get more information as to why it is giving these errors? Here are a few facts: - the system is fine with memory all the time, never runs out (resident/paging) - there are not an unusual amount of processes running, maybe around 200 at a time, but no where near the ulimit setting - ulimit for the 'nagios' user matches that of root (unlimited). here is the ulimit: core file size (blocks, -c) 0 data seg size (kbytes, -d) unlimited file size (blocks, -f) unlimited max locked memory (kbytes, -l) 4 max memory size (kbytes, -m) unlimited open files (-n) 1024 pipe size (512 bytes, -p) 8 stack size (kbytes, -s) 10240 cpu time (seconds, -t) unlimited max user processes (-u) 7168 virtual memory (kbytes, -v) unlimited Thanks, Terry On 9/1/05, Fred wrote: > My guess would be to look at your resource utilization on your system, > most likely causes for fork() to fail are no more process slots, out of > memory, or past some kind of per-user (non-root) limit. When this > occurs look at your system logs, ps output and see if you have *lots* > of processes hanging around. It could be that nagios has stopped reaping > its children (or another unrelated process has sucked up the resources) > and you have simply pushed your system to the edge. It might be that you > get to that situation and it backs off before you even notice it and you > are left with nagios having problems dealing with the aftermath. > > -FredC > > --- Terry wrote: > > > Hello, > > > > I have been having this issue for quite some time. For some unknown > > reason, nagios stops performing checks with these errors: > > > > [1125536952] Warning: The check of service 'PING' on host 'hostname' > > could not be performed due to a fork() error. The check will be > > rescheduled. > > > > All checks fail like this until nagios is restarted. When this > > problem is occuring I can run the service checks manually both as the > > nagios user and as the root user. There are no resource problems that > > I can see at the time. We do not appear to be hitting a limit with > > open files or anything like that either. The nagios mirrors the root > > user in that area. > > > > What could be wrong? > > > > Thanks! > > > > > > ------------------------------------------------------- > > SF.Net email is Sponsored by the Better Software Conference & EXPO > > September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices > > Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA > > Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf > > _______________________________________________ > > Nagios-users mailing list > > Nagios-users at lists.sourceforge.net > > https://lists.sourceforge.net/lists/listinfo/nagios-users > > ::: Please include Nagios version, plugin version (-v) and OS when reporting > > any issue. > > ::: Messages without supporting info will risk being sent to /dev/null > > > > > > > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From Mark.Law at thomson.com Fri Sep 2 22:28:14 2005 From: Mark.Law at thomson.com (Mark.Law at thomson.com) Date: Fri, 2 Sep 2005 16:28:14 -0400 Subject: check_nt configuration Message-ID: I have a different problem with check_nt. >From the command line: /usr/lib/nagios/plugins/check_nt -H servername -p 1248 -v SERVICESTATE -d SHOWALL -l "HUMMINGBIRD STR SERVICE" works great and returns: HUMMINGBIRD STR SERVICE: Started just like it is supposed to do. However, when run from nagios it says: HUMMINGBIRD STR SERVICE: Unknown The check_nt command definition has been modified like this: command_line /usr/lib/nagios/plugins/check_nt -H $HOSTADDRESS$ -v SERVICESTATE -d SHOWALL -l $ARG1$ I removed the quotes from around $ARG1$ so that multiple services could be checked by passing with embedded quotes, like "server","service with space in the name", etc. This works great on one server running nagios but not the other. I have check and re-checked paths, permissions, check_nt versions and am stumped. Any help out there? Thanks, Mark ________________________________ From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users-admin at lists.sourceforge.net] On Behalf Of Jon Sent: Thursday, August 25, 2005 2:46 PM To: Nagios-Users Subject: RE: [Nagios-users] check_nt configuration Hi, Since you mentioned that you have installed the plugins in a different location, did you make change to the $USER1$ variable to point it to the correct location? To set the variable $USER1$ to /usr/lib/nagios/plugins/ you need to edit the resource.cfg. You can also try executing the command to see if it actually works. Just go to where the check_nt is located and run it. That would eliminate a few things. HTH, Jon -------------- next part -------------- An HTML attachment was scrubbed... URL: From blakekrone at gmail.com Fri Sep 2 23:24:28 2005 From: blakekrone at gmail.com (Blake Krone) Date: Fri, 2 Sep 2005 15:24:28 -0600 Subject: Setting env variables from misc commands (email from field) Message-ID: Hello all, I'm trying to set it so that when using nullmailer the From field will be "alerts" instead of root as it is currently. I tried to set it by doing: export USER=alerts;/usr/bin/printf "%s" ...... etc but nagios won't send out alerts when I have it set like that. How can I set the from field?? Thanks! Blake -------------- next part -------------- An HTML attachment was scrubbed... URL: From ladams at cloudmark.com Fri Sep 2 23:46:11 2005 From: ladams at cloudmark.com (Lori Adams) Date: Fri, 2 Sep 2005 14:46:11 -0700 Subject: Downtime on hosts & services Message-ID: This was in the docs: As a side note, notifications for services are suppressed if the host they're associated with is in a period of scheduled downtime. Are you sure he had it in a downtime, there was a "thought bubble" and the "z,z,z" picture next to your host? -Lori ________________________________ From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users-admin at lists.sourceforge.net] On Behalf Of Danny Russell Sent: Friday, September 02, 2005 1:02 PM To: nagios-users at lists.sourceforge.net Subject: [Nagios-users] Downtime on hosts & services I have many hosts setup with many services on each. Nagios is working and monitoring. I am running v1.2 I schedule downtime on a host, but not its services. I would think that when I schedule downtime on the host it would not bother me about the services on the host. I am still getting notifications about the services on that host. Why is this? Is this by design or is it an option to change somewhere? Thanks! Danny -------------- next part -------------- An HTML attachment was scrubbed... URL: From sghosh at sghosh.org Sat Sep 3 05:10:59 2005 From: sghosh at sghosh.org (Subhendu Ghosh) Date: Fri, 2 Sep 2005 23:10:59 -0400 (EDT) Subject: Setting env variables from misc commands (email from field) In-Reply-To: References: Message-ID: On Fri, 2 Sep 2005, Blake Krone wrote: > Hello all, I'm trying to set it so that when using nullmailer the From field > will be "alerts" instead of root as it is currently. I tried to set it by > doing: > export USER=alerts;/usr/bin/printf "%s" ...... etc > but nagios won't send out alerts when I have it set like that. > > How can I set the from field?? > > Thanks! > Blake > sendmail genericstable http://www.linux.com/howtos/Sendmail-Address-Rewrite.shtml -- ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From amontibello at gmail.com Sat Sep 3 10:42:21 2005 From: amontibello at gmail.com (Anthony Montibello) Date: Sat, 3 Sep 2005 04:42:21 -0400 Subject: check_nt configuration In-Reply-To: References: Message-ID: I know the following works for NC_Net but I am not sure if it works for ns_client but I assume it should. add the " on the command definition again, command_line /usr/lib/nagios/plugins/check_nt -H $HOSTADDRESS$ -v SERVICESTATE -d SHOWALL -l "$ARG1$" because the check_command is delimited by the ! and can have spaces in it without the " thus: check_command check_nt!HUMMINGBIRD STR SERVICE I use the following in my config check_command check_nc_net_with_l_d!SERVICESTATE!MSSQL\$BKUPEXEC,Backup Exec Job Engine,Backup Exec Server!SHOWALL define command{ command_name check_nc_net_with_l_d command_line $USER1$/check_nt -H $HOSTADDRESS$ -p 1248 -v $ARG1$ -l "$ARG2$" -d $ARG3$ } I hope this helps Tony amontibello at shatterit.com On 9/2/05, Mark.Law at thomson.com wrote: > > I have a different problem with check_nt. > > From the command line: > > /usr/lib/nagios/plugins/check_nt -H servername -p 1248 -v SERVICESTATE -d > SHOWALL -l "HUMMINGBIRD STR SERVICE" > > works great and returns: > > HUMMINGBIRD STR SERVICE: Started > > just like it is supposed to do. However, when run from nagios it says: > > HUMMINGBIRD STR SERVICE: Unknown > > The check_nt command definition has been modified like this: > > command_line /usr/lib/nagios/plugins/check_nt -H $HOSTADDRESS$ -v > SERVICESTATE -d SHOWALL -l $ARG1$ > > I removed the quotes from around $ARG1$ so that multiple services could > be checked by passing with embedded quotes, like "server","service with > space in the name", etc. > > This works great on one server running nagios but not the other. I have > check and re-checked paths, permissions, check_nt versions and am stumped. > > Any help out there? > > Thanks, > > Mark > > ------------------------------ > > *From:* nagios-users-admin at lists.sourceforge.net [mailto: > nagios-users-admin at lists.sourceforge.net] *On Behalf Of *Jon > *Sent:* Thursday, August 25, 2005 2:46 PM > *To:* Nagios-Users > *Subject:* RE: [Nagios-users] check_nt configuration > > Hi, > > Since you mentioned that you have installed the plugins in a different > location, did you make change to the $USER1$ variable to point it to the > correct location? To set the variable $USER1$ to /usr/lib/nagios/plugins/ > you need to edit the resource.cfg. > > You can also try executing the command to see if it actually works. Just > go to where the check_nt is located and run it. That would eliminate a few > things. > > HTH, > > Jon > -------------- next part -------------- An HTML attachment was scrubbed... URL: From jhmartin at toger.us Sat Sep 3 16:58:21 2005 From: jhmartin at toger.us (Jason Martin) Date: Sat, 3 Sep 2005 07:58:21 -0700 Subject: nagios pooling question In-Reply-To: References: Message-ID: <20050903145821.GD810@zippy.toger.us> On Thu, Sep 01, 2005 at 02:14:20PM -0400, William Wang wrote: > Can someone tell me how Nagios does pooling when it connect to the NT or UNIX agents? Are TCP or UDP ports used? What's the port numbers? Nagios itself doesn't know how to check anything, it is up to the plugins to do that. Many of the UNIX plugins are not network-aware, instead relying on the NRPE system (which uses a tcp-based daemon) for network transport. Windows works in a similiar fashion. -Jason Martin -- Buy Land Now. It's Not Being Made Any More. This message is PGP/MIME signed. -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: application/pgp-signature Size: 211 bytes Desc: not available URL: From lists at webtent.net Mon Sep 5 05:13:29 2005 From: lists at webtent.net (Robert Fitzpatrick) Date: Sun, 04 Sep 2005 23:13:29 -0400 Subject: new install won't start Message-ID: <1125890009.10655.5.camel@columbus.webtent.org> I am lost. I am setup Nagios 1.2 many times. Now I have setup on my SuSE 9.3 box with rpm packages. All the files are there, I believe the permissions are fine, the configuration file checks out OK with '-v' and the log file just gives me the following. The lock file even has the PID, but no such PID running after starting. Any ideas... [1125887280] Nagios 1.2 starting... (PID=8639) [1125887280] Finished daemonizing... (New PID=8640) -- Robert ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From al at its-lehmann.de Mon Sep 5 09:05:33 2005 From: al at its-lehmann.de (Arno Lehmann) Date: Mon, 05 Sep 2005 09:05:33 +0200 Subject: new install won't start In-Reply-To: <1125890009.10655.5.camel@columbus.webtent.org> References: <1125890009.10655.5.camel@columbus.webtent.org> Message-ID: <431BEE3D.2020608@its-lehmann.de> Hello, Robert Fitzpatrick wrote: > I am lost. I am setup Nagios 1.2 many times. Now I have setup on my SuSE > 9.3 box with rpm packages. All the files are there, I believe the > permissions are fine, the configuration file checks out OK with '-v' and > the log file just gives me the following. The lock file even has the > PID, but no such PID running after starting. Any ideas... > > [1125887280] Nagios 1.2 starting... (PID=8639) > [1125887280] Finished daemonizing... (New PID=8640) What does happen when you start Nagios in the foreground, i.e. without -d switch, and strace it? Arno > -- > Robert > > > > ------------------------------------------------------- > SF.Net email is Sponsored by the Better Software Conference & EXPO > September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices > Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA > Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -- IT-Service Lehmann al at its-lehmann.de Arno Lehmann http://www.its-lehmann.de ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nagios_user at s1test1.it Mon Sep 5 16:40:43 2005 From: nagios_user at s1test1.it (nagios_user at s1test1.it) Date: Mon, 5 Sep 2005 16:40:43 +0200 Subject: nagios not running?? Message-ID: <20050905164043.7qgk65yj22roc0ok@webmail.s1test1.it> Hello, I have a little question to issue. I've upgrated my nagios server from 1.2 to 2.04b. I've tried to run nagios and I've received following message Whoops! Error: Could not read host and service status information! .... So, I've checked configuration with nagios -v configuration_file and it seem to be ok... ...have you any ideas? p.s. nagios running without error and I can see the PID Thank you for your help and sorry for my english. I hope I explain my problem... Best regards Rodolfo Greco ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From lists at webtent.net Mon Sep 5 17:08:42 2005 From: lists at webtent.net (Robert Fitzpatrick) Date: Mon, 05 Sep 2005 11:08:42 -0400 Subject: new install won't start In-Reply-To: <431BEE3D.2020608@its-lehmann.de> References: <1125890009.10655.5.camel@columbus.webtent.org> <431BEE3D.2020608@its-lehmann.de> Message-ID: <1125932922.19345.0.camel@columbus.webtent.org> On Mon, 2005-09-05 at 09:05 +0200, Arno Lehmann wrote: > Hello, > > Robert Fitzpatrick wrote: > > > I am lost. I am setup Nagios 1.2 many times. Now I have setup on my SuSE > > 9.3 box with rpm packages. All the files are there, I believe the > > permissions are fine, the configuration file checks out OK with '-v' and > > the log file just gives me the following. The lock file even has the > > PID, but no such PID running after starting. Any ideas... > > > > [1125887280] Nagios 1.2 starting... (PID=8639) > > [1125887280] Finished daemonizing... (New PID=8640) > > What does happen when you start Nagios in the foreground, i.e. without > -d switch, and strace it? Thanks, that got it going after getting all the perms right. The only problem I'm having is the CGI execution. I used to have Nagios running on this same machine and have all the backup files from that install. I have matched everything up to what is was before, but still getting a perms error when starting nagios, but it starts fine and writes the files it was complaining about. The init script is using 'daemon' as the user, if I change all my nagios perms to that user, the error goes away and it starts, but still no CGI capability. The old config had 'daemon' also in the init script and all worked fine. I have the user 'nagios' and group 'nagiocmd' in my nagios.cfg file. columbus:/var/spool/nagios # ls -la total 12 drwxrwsr-x 3 nagios nagios 4096 Sep 5 10:55 . drwxr-xr-x 17 root root 4096 Sep 4 17:36 .. drwxr-sr-x 2 nagios nagiocmd 4096 Sep 5 10:54 rw columbus:/var/spool/nagios # ls -la rw total 8 drwxr-sr-x 2 nagios nagiocmd 4096 Sep 5 10:54 . drwxrwsr-x 3 nagios nagios 4096 Sep 5 10:55 .. columbus:/var/spool/nagios # /etc/rc.d/nagios start Starting nagios touch: cannot touch `/var/log/nagios/nagios.log': Permission denied touch: cannot touch `/var/log/nagios/status.sav': Permission denied done columbus:/var/spool/nagios # ls -la total 32 drwxrwsr-x 3 nagios nagios 4096 Sep 5 10:55 . drwxr-xr-x 17 root root 4096 Sep 4 17:36 .. -rw-rw-r-- 1 nagios nagios 0 Sep 5 10:55 comment.log -rw-rw-r-- 1 nagios nagios 0 Sep 5 10:55 downtime.log -rw-r--r-- 1 root nagios 6 Sep 5 10:55 nagios.lock -rw-r--r-- 1 root nagios 101 Sep 5 10:55 nagios.log drwxr-sr-x 2 nagios nagiocmd 4096 Sep 5 10:55 rw -rw-rw-r-- 1 nagios nagios 11249 Sep 5 10:55 status.log columbus:/var/spool/nagios # ls -la rw total 8 drwxr-sr-x 2 nagios nagiocmd 4096 Sep 5 10:55 . drwxrwsr-x 3 nagios nagios 4096 Sep 5 10:59 .. prw-rw---- 1 nagios nagiocmd 0 Sep 5 10:55 nagios.cmd ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nagios_user at s1test1.it Mon Sep 5 17:31:53 2005 From: nagios_user at s1test1.it (nagios_user at s1test1.it) Date: Mon, 5 Sep 2005 17:31:53 +0200 Subject: nagios not running?? Message-ID: <20050905173153.fakf1i2l10akw8sw@webmail.s1test1.it> Hello, I have a little question to issue. I've upgrated my nagios server from 1.2 to 2.04b. I've tried to run nagios and I've received following message Whoops! Error: Could not read host and service status information! .... So, I've checked configuration with nagios -v configuration_file and it seem to be ok... ...have you any ideas? p.s. nagios running without error and I can see the PID Thank you for your help and sorry for my english. I hope I explain my problem... Best regards Rodolfo Greco ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From al at its-lehmann.de Mon Sep 5 20:22:38 2005 From: al at its-lehmann.de (Arno Lehmann) Date: Mon, 05 Sep 2005 20:22:38 +0200 Subject: new install won't start In-Reply-To: <1125932922.19345.0.camel@columbus.webtent.org> References: <1125890009.10655.5.camel@columbus.webtent.org> <431BEE3D.2020608@its-lehmann.de> <1125932922.19345.0.camel@columbus.webtent.org> Message-ID: <431C8CEE.1010701@its-lehmann.de> Hello, Robert Fitzpatrick wrote: > Thanks, that got it going after getting all the perms right. The only > problem I'm having is the CGI execution. I used to have Nagios running > on this same machine and have all the backup files from that install. I > have matched everything up to what is was before, but still getting a > perms error when starting nagios, but it starts fine and writes the > files it was complaining about. The init script is using 'daemon' as the > user, if I change all my nagios perms to that user, the error goes away > and it starts, but still no CGI capability. The old config had 'daemon' > also in the init script and all worked fine. I have the user 'nagios' > and group 'nagiocmd' in my nagios.cfg file. That looks like it might be a permissions problem with the web server user. Remember that the CGIs run as the same user you web server runs as. And between distributions that usually not the same, and it might even change from one version of the distribution to another. You should check what user and group your web server runs as, and probably add it to the nagiocmd group. There is some information in the manual. Arno > columbus:/var/spool/nagios # ls -la > total 12 > drwxrwsr-x 3 nagios nagios 4096 Sep 5 10:55 . > drwxr-xr-x 17 root root 4096 Sep 4 17:36 .. > drwxr-sr-x 2 nagios nagiocmd 4096 Sep 5 10:54 rw > columbus:/var/spool/nagios # ls -la rw > total 8 > drwxr-sr-x 2 nagios nagiocmd 4096 Sep 5 10:54 . > drwxrwsr-x 3 nagios nagios 4096 Sep 5 10:55 .. > columbus:/var/spool/nagios # /etc/rc.d/nagios start > Starting nagios touch: cannot touch `/var/log/nagios/nagios.log': > Permission denied > touch: cannot touch `/var/log/nagios/status.sav': Permission denied > > done > columbus:/var/spool/nagios # ls -la > total 32 > drwxrwsr-x 3 nagios nagios 4096 Sep 5 10:55 . > drwxr-xr-x 17 root root 4096 Sep 4 17:36 .. > -rw-rw-r-- 1 nagios nagios 0 Sep 5 10:55 comment.log > -rw-rw-r-- 1 nagios nagios 0 Sep 5 10:55 downtime.log > -rw-r--r-- 1 root nagios 6 Sep 5 10:55 nagios.lock > -rw-r--r-- 1 root nagios 101 Sep 5 10:55 nagios.log > drwxr-sr-x 2 nagios nagiocmd 4096 Sep 5 10:55 rw > -rw-rw-r-- 1 nagios nagios 11249 Sep 5 10:55 status.log > columbus:/var/spool/nagios # ls -la rw > total 8 > drwxr-sr-x 2 nagios nagiocmd 4096 Sep 5 10:55 . > drwxrwsr-x 3 nagios nagios 4096 Sep 5 10:59 .. > prw-rw---- 1 nagios nagiocmd 0 Sep 5 10:55 nagios.cmd > > -- IT-Service Lehmann al at its-lehmann.de Arno Lehmann http://www.its-lehmann.de ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From al at its-lehmann.de Mon Sep 5 20:24:26 2005 From: al at its-lehmann.de (Arno Lehmann) Date: Mon, 05 Sep 2005 20:24:26 +0200 Subject: nagios not running?? In-Reply-To: <20050905173153.fakf1i2l10akw8sw@webmail.s1test1.it> References: <20050905173153.fakf1i2l10akw8sw@webmail.s1test1.it> Message-ID: <431C8D5A.3010503@its-lehmann.de> Hi, nagios_user at s1test1.it wrote twice: > Hello, > I have a little question to issue. > I've upgrated my nagios server from 1.2 to 2.04b. > I've tried to run nagios and I've received following message > > Whoops! > > Error: Could not read host and service status information! > > .... Where do you receive that message? I assume in the web browser, right? > So, I've checked configuration with nagios -v configuration_file and it seem to > be ok... > ...have you any ideas? Yes. Check the web server / nagios CGI setup. As far as I recall, there is detailed information in the manual. Basically, make sure that the web server user can read the status information. > p.s. nagios running without error and I can see the PID > Thank you for your help and sorry for my english. I hope I explain my problem... Arno > Best regards > Rodolfo Greco > > > > ------------------------------------------------------- > SF.Net email is Sponsored by the Better Software Conference & EXPO > September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices > Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA > Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -- IT-Service Lehmann al at its-lehmann.de Arno Lehmann http://www.its-lehmann.de ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From forums at emat.be Mon Sep 5 22:38:26 2005 From: forums at emat.be (js) Date: Mon, 05 Sep 2005 22:38:26 +0200 Subject: Modbus over tcp/ip plugins Message-ID: <431CACC2.10300@emat.be> Hi all, Not really a nagios question, although. Is there someone with experience polling modbus over tcp/ip enabled devices on linux with nagios? Thanks J ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From niceforums at yahoo.com Tue Sep 6 10:21:57 2005 From: niceforums at yahoo.com (hamideh daliri) Date: Tue, 6 Sep 2005 01:21:57 -0700 (PDT) Subject: nagios works with SELinux , but it is silly ! Message-ID: <20050906082157.82786.qmail@web30111.mail.mud.yahoo.com> hi all i had well known problem with nagios on RHEL4 ( you know i mean SELinux and its policy .... ) , i have added these rules to /etc/selinux/targeted/src/policy/domains/program/apache.te : allow httpd_t usr_t:file {execute execute_no_trans}; allow httpd_t file_t:file {getattr execute read execute_no_trans}; and the nagios is now working,but it seems so silly to grant those permisions !!! would someone help to optimize rules above ? tnx. ______________________________________________________ Click here to donate to the Hurricane Katrina relief effort. http://store.yahoo.com/redcross-donate3/ ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From tylerawoods at gmail.com Tue Sep 6 17:14:50 2005 From: tylerawoods at gmail.com (Tyler Woods) Date: Tue, 06 Sep 2005 08:14:50 -0700 Subject: Forbidden Error Message-ID: <431DB26A.9000506@gmail.com> I've followed the directions exactly but get the following error when trying to view the site: / Forbidden You don't have permission to access /nagios/ on this server. Apache/2.0.46 (Red Hat) Server at docapp Port 443/ Please help. Have no idea why this is happening. ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From lyle at lcrcomputer.net Tue Sep 6 17:32:30 2005 From: lyle at lcrcomputer.net (Lyle Giese) Date: Tue, 06 Sep 2005 10:32:30 -0500 Subject: Forbidden Error In-Reply-To: <431DB26A.9000506@gmail.com> References: <431DB26A.9000506@gmail.com> Message-ID: <431DB68E.3010704@lcrcomputer.net> Tyler Woods wrote: > I've followed the directions exactly but get the following error when > trying to view the site: > / > Forbidden > You don't have permission to access /nagios/ on this server. > Apache/2.0.46 (Red Hat) Server at docapp Port 443/ > > Please help. Have no idea why this is happening. > You are trying to access Nagios via ssl(https). I don't think this is covered in the docs. Besides, this is basically a virtual server to Apache and requires the alias and access controls added to the basic system has to be added to the ssl sections in httpd.conf. And Apache 2.0.46 is not a current version. Hopefully you have it patched from Red Hat. Red Hat does not always change the version numbers when patching. Lyle ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From srunschke at abit.de Tue Sep 6 17:33:49 2005 From: srunschke at abit.de (srunschke at abit.de) Date: Tue, 6 Sep 2005 17:33:49 +0200 Subject: Antwort: Forbidden Error In-Reply-To: <431DB26A.9000506@gmail.com> References: <431DB26A.9000506@gmail.com> Message-ID: nagios-users-admin at lists.sourceforge.net schrieb am 06.09.2005 17:14:50: > I've followed the directions exactly but get the following error when > trying to view the site: > / > Forbidden > You don't have permission to access /nagios/ on this server. > Apache/2.0.46 (Red Hat) Server at docapp Port 443/ > > Please help. Have no idea why this is happening. Obviously an error in the apache configuration. Recheck permissions within httpd.conf and the file permissions. Looking at the apache errorlogs will help you loads too. Posting all that stuff here might even get you to a solution ;) regards sash -------------------------------------------------- Sascha Runschke Netzwerk Administration IT-Services ABIT AG Robert-Bosch-Str. 1 40668 Meerbusch Tel.:+49 (0) 2150.9153.226 Mobil:+49 (0) 173.5419665 mailto:SRunschke at abit.de http://www.abit.net http://www.abit-epos.net --------------------------------- Sicherheitshinweis zur E-Mail Kommunikation / Security note regarding email communication: http://www.abit.net/sicherheitshinweis.html ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From lyle at lcrcomputer.net Tue Sep 6 17:41:14 2005 From: lyle at lcrcomputer.net (Lyle Giese) Date: Tue, 06 Sep 2005 10:41:14 -0500 Subject: Access to status.log via a Perl script Message-ID: <431DB89A.7010807@lcrcomputer.net> I am working on building one of the traffic light project for Nagios. (specifically http://www.nagiosexchange.org/Wiki_Home.wiki.0.html?&tx_drwiki_pi1%5Bkeyword%5D=NagiosTLight ) I have one quick question for the experts here. It looks like Juilian got his status via status.cgi and parsed the resulting webpage that comes back. I have a small configuration and want to pull status from status.log file and then customize the status. I have most of it written. The question I have is about status.log. I am writting a small perl script and opening status.log read-only, read it into an array and promptly close it. Will that cause any problems with Nagios? What happens if Nagios wants to write to it at the same time this script is reading status.log? I know this solution does not scale well, but for what I am doing, it will never get that big. Thanks, Lyle Giese ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Tue Sep 6 17:58:57 2005 From: ae at op5.se (Andreas Ericsson) Date: Tue, 06 Sep 2005 17:58:57 +0200 Subject: Access to status.log via a Perl script In-Reply-To: <431DB89A.7010807@lcrcomputer.net> References: <431DB89A.7010807@lcrcomputer.net> Message-ID: <431DBCC1.4090809@op5.se> Lyle Giese wrote: > I am working on building one of the traffic light project for Nagios. > (specifically > http://www.nagiosexchange.org/Wiki_Home.wiki.0.html?&tx_drwiki_pi1%5Bkeyword%5D=NagiosTLight > ) > > I have one quick question for the experts here. It looks like Juilian > got his status via status.cgi and parsed the resulting webpage that > comes back. I have a small configuration and want to pull status from > status.log file and then customize the status. I have most of it written. > > The question I have is about status.log. I am writting a small perl > script and opening status.log read-only, read it into an array and > promptly close it. Will that cause any problems with Nagios? What > happens if Nagios wants to write to it at the same time this script is > reading status.log? > You'll end up with either an incomplete read, or Nagios will beat you to it and write the data before you have time to read it. Nothing else should happen. If, on the other hand, you're using Nagios 2.0 you should be reading status.dat instead, since it contains the most recent info. That file is not appended to. Instead, nagios creates a temporary file and renames it when it's done writing. The GUI does exactly what you're suggesting, and no problems have been reported due to that (although the gui mmap()'s the file readonly rather than opening it and reading it. I'm not sure that option is available in perl). -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Lead Developer ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Mark.Law at thomson.com Tue Sep 6 18:29:54 2005 From: Mark.Law at thomson.com (Mark.Law at thomson.com) Date: Tue, 6 Sep 2005 12:29:54 -0400 Subject: check_nt configuration Message-ID: Can someone explain why the first definition fails and the second succeeds? I should note that the first definition only fails if the "service name" has spaces in it. define command{ command_name check_nc_net_with_l_d command_line $USER1$/check_nt -H $HOSTADDRESS$ -p 1248 -v $ARG1$ -l "service name" } define command{ command_name check_nc_net_with_l_d command_line $USER1$/check_nt -H hostname -p 1248 -v $ARG1$ -l "service name" } ________________________________ From: Anthony Montibello [mailto:amontibello at gmail.com] Sent: Saturday, September 03, 2005 4:42 AM To: Law, Mark (TSH Center); Nagios Users List Subject: Re: [Nagios-users] check_nt configuration I know the following works for NC_Net but I am not sure if it works for ns_client but I assume it should. add the " on the command definition again, command_line /usr/lib/nagios/plugins/check_nt -H $HOSTADDRESS$ -v SERVICESTATE -d SHOWALL -l "$ARG1$" because the check_command is delimited by the ! and can have spaces in it without the " thus: check_command check_nt!HUMMINGBIRD STR SERVICE I use the following in my config check_command check_nc_net_with_l_d!SERVICESTATE!MSSQL\$BKUPEXEC,Backup Exec Job Engine,Backup Exec Server!SHOWALL define command{ command_name check_nc_net_with_l_d command_line $USER1$/check_nt -H $HOSTADDRESS$ -p 1248 -v $ARG1$ -l "$ARG2$" -d $ARG3$ } I hope this helps Tony amontibello at shatterit.com On 9/2/05, Mark.Law at thomson.com wrote: I have a different problem with check_nt. >From the command line: /usr/lib/nagios/plugins/check_nt -H servername -p 1248 -v SERVICESTATE -d SHOWALL -l "HUMMINGBIRD STR SERVICE" works great and returns: HUMMINGBIRD STR SERVICE: Started just like it is supposed to do. However, when run from nagios it says: HUMMINGBIRD STR SERVICE: Unknown The check_nt command definition has been modified like this: command_line /usr/lib/nagios/plugins/check_nt -H $HOSTADDRESS$ -v SERVICESTATE -d SHOWALL -l $ARG1$ I removed the quotes from around $ARG1$ so that multiple services could be checked by passing with embedded quotes, like "server","service with space in the name", etc. This works great on one server running nagios but not the other. I have check and re-checked paths, permissions, check_nt versions and am stumped. Any help out there? Thanks, Mark ________________________________ From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users-admin at lists.sourceforge.net ] On Behalf Of Jon Sent: Thursday, August 25, 2005 2:46 PM To: Nagios-Users Subject: RE: [Nagios-users] check_nt configuration Hi, Since you mentioned that you have installed the plugins in a different location, did you make change to the $USER1$ variable to point it to the correct location? To set the variable $USER1$ to /usr/lib/nagios/plugins/ you need to edit the resource.cfg. You can also try executing the command to see if it actually works. Just go to where the check_nt is located and run it. That would eliminate a few things. HTH, Jon -------------- next part -------------- An HTML attachment was scrubbed... URL: From lyle at lcrcomputer.net Tue Sep 6 19:06:32 2005 From: lyle at lcrcomputer.net (Lyle Giese) Date: Tue, 06 Sep 2005 12:06:32 -0500 Subject: Access to status.log via a Perl script In-Reply-To: <431DBCC1.4090809@op5.se> References: <431DB89A.7010807@lcrcomputer.net> <431DBCC1.4090809@op5.se> Message-ID: <431DCC98.6030305@lcrcomputer.net> Andreas Ericsson wrote: > Lyle Giese wrote: > >> I am working on building one of the traffic light project for >> Nagios. (specifically >> http://www.nagiosexchange.org/Wiki_Home.wiki.0.html?&tx_drwiki_pi1%5Bkeyword%5D=NagiosTLight >> ) >> >> I have one quick question for the experts here. It looks like >> Juilian got his status via status.cgi and parsed the resulting >> webpage that comes back. I have a small configuration and want to >> pull status from status.log file and then customize the status. I >> have most of it written. >> >> The question I have is about status.log. I am writting a small perl >> script and opening status.log read-only, read it into an array and >> promptly close it. Will that cause any problems with Nagios? What >> happens if Nagios wants to write to it at the same time this script >> is reading status.log? >> > > You'll end up with either an incomplete read, or Nagios will beat you > to it and write the data before you have time to read it. Nothing else > should happen. > > If, on the other hand, you're using Nagios 2.0 you should be reading > status.dat instead, since it contains the most recent info. That file > is not appended to. Instead, nagios creates a temporary file and > renames it when it's done writing. The GUI does exactly what you're > suggesting, and no problems have been reported due to that (although > the gui mmap()'s the file readonly rather than opening it and reading > it. I'm not sure that option is available in perl). > Juilian's script seemed to only read the status of a host from what I could piece together. I wanted to customize the traffic light in that for instance on host A, if service http is down, I want that to be a red on the traffic light, but if ftp was down on the same host, that would rate a yellow. Basicaly, customizing the traffic light based on what the importance of a certain service is in the scheme of our operations. You seem to indicate that I can read the GUI in the same manner, but I was unable to find any real docs on status.cgi(I am currently running v1.1 of Nagios) to find the options I could pass to it to get back the info I want access to. And I am not a great programmer either. An incomplete read could cause the status to change to green and then on the next read back to yellow or red as the case maybe, but neither would be of that great of importance in the operations here. Thanks, Lyle ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ravikmrs at yahoo.com Tue Sep 6 19:13:11 2005 From: ravikmrs at yahoo.com (Ravi Kumar) Date: Tue, 6 Sep 2005 10:13:11 -0700 (PDT) Subject: rrd command Message-ID: <20050906171311.91915.qmail@web53901.mail.yahoo.com> Hi, What is command syntax of rrd to create graph for load or ping or etc ? thanks __________________________________________________ Do You Yahoo!? Tired of spam? Yahoo! Mail has the best spam protection around http://mail.yahoo.com -------------- next part -------------- An HTML attachment was scrubbed... URL: From marc at ena.com Tue Sep 6 20:22:15 2005 From: marc at ena.com (Marc Powell) Date: Tue, 6 Sep 2005 13:22:15 -0500 Subject: rrd command Message-ID: > -----Original Message----- > From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users- > admin at lists.sourceforge.net] On Behalf Of Ravi Kumar > Sent: Tuesday, September 06, 2005 12:13 PM > To: nagios-users at lists.sourceforge.net > Subject: [Nagios-users] rrd command > > Hi, > What is command syntax of rrd to create graph for load or ping or etc ? > thanks /path/to/rrdtool create http://people.ee.ethz.ch/~oetiker/webtools/rrdtool/doc/rrdcreate.en.html The RRDTool Tutorial will probably be useful to you -- http://people.ee.ethz.ch/~oetiker/webtools/rrdtool/tut/rrdtutorial.en.ht ml Unless you can be more specific about what you're trying to do, how you're trying to do it and how it relates to Nagios, that's about as specific as we can get for an answer. -- Marc ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jsmforum at optonline.net Tue Sep 6 21:44:17 2005 From: jsmforum at optonline.net (Jeff) Date: Tue, 6 Sep 2005 15:44:17 -0400 Subject: Perl plugin exit code? In-Reply-To: References: Message-ID: Hey all, I've written a perl script to check the status of some information from one of our application databases. It seems to run fine manually but nagios only get's (No Output!) when I try to use it live. Permissions on the perl script are set correctly, if I su - over to nagios I can run the script as nagios. So I'm stumped and looking for ideas here. Second part of my question has to do with perl more than nagios, Obviously I need to send the correct exit code to nagios. I set a varialble "$status" (0, 1 or 2) in the script according to the query results and at the end of the script I have Exit $status; Will this give the correct exit status that nagios is looking for? Thanks, Jeff ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From teng at dataway.com Tue Sep 6 22:17:26 2005 From: teng at dataway.com (Tedman Eng) Date: Tue, 6 Sep 2005 13:17:26 -0700 Subject: Perl plugin exit code? Message-ID: <37ED92F9890FAF4BB947613C66FF8B1A08BB2B84@dw-mail.dataway.com> > Second part of my question has to do with perl more than nagios, > Obviously I need to send the correct exit code to nagios. I set a > varialble "$status" (0, 1 or 2) in the script according to the query > results and at the end of the script I have > > Exit $status; > The command will not work unless "exit" is lower case. This may be so in your script, but wasn't in your email. (it won't hurt to look again to make sure) $ perl -e '$status=0;exit $status'; echo $? 0 $ perl -e '$status=2;Exit $status'; echo $? Can't call method "Exit" without a package or object reference at -e line 1. 255 ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From td3201 at gmail.com Tue Sep 6 16:46:38 2005 From: td3201 at gmail.com (Terry) Date: Tue, 6 Sep 2005 09:46:38 -0500 Subject: fork errors In-Reply-To: <20050902201629.7889.qmail@web31906.mail.mud.yahoo.com> References: <8ee0610105090207415fef2316@mail.gmail.com> <20050902201629.7889.qmail@web31906.mail.mud.yahoo.com> Message-ID: <8ee061010509060746544edf47@mail.gmail.com> I haven't tried the fork script but I did try starting nagios under strace as described but the main process which strace is tracing appears to exit after spawning children. Here is the last snippet so you know what I mean: [pid 29599] clone(Process 29644 attached [pid 29643] <... clone resumed> child_stack=0, flags=CLONE_CHILD_CLEARTID|CLONE_CHILD_SETTID|SIGCHLD, child_tidptr=0xb75e50c8) = 29644 [tcb table full] [pid 29599] <... clone resumed> child_stack=0, flags=CLONE_CHILD_CLEARTID|CLONE_CHILD_SETTID|SIGCHLD, child_tidptr=0xb75e50c8) = 29645 [tcb table full] Process 29641 detached Process 29599 detached Process 29632 detached Process 29640 detached Process 29635 detached Process 29644 detached Process 29643 detached Any other ideas? On 9/2/05, Fred wrote: > > Just for fun, you might try creating the problem and see how many forks > you *can* get, for example: > #!/usr/bin/perl > > my $c=0; > do { > my $pid = fork(); > if ($pid) > { > $c++; > print "\rchildcount $c "; > } > else > { > sleep(1); > exit(0); > } > } while 1; > to create as many procs as you can and test your limit. You would > want to do this under the same environment as the nagios process > runs. > They will all be kept defunct until the process exits (when you > hit the max processes you can create) > The other thing you might try is to start nagios under > strace -f and output the data to a log. You can specify > just forks for strace, i.e., strace -f -e trace=process >/tmp/,log 2>&1 > nagios .... > That would give you a good handle on what is going on when the failure > occurs. Might slow nagios down a bit, but probably nothing significant. > -FredC > > > *Terry * wrote: > > I have a program that checks the logs by the minute and pages when the > fork errors occur, so we are responding within minutes. I have looked > at the resources every time it happens and we have plenty of > resources. Is there a single plugin I can put into debugging mode so > that when this happens I get more information as to why it is giving > these errors? Here are a few facts: > - the system is fine with memory all the time, never runs out > (resident/paging) > - there are not an unusual amount of processes running, maybe around > 200 at a time, but no where near the ulimit setting > - ulimit for the 'nagios' user matches that of root (unlimited). here > is the ulimit: > core file size (blocks, -c) 0 > data seg size (kbytes, -d) unlimited > file size (blocks, -f) unlimited > max locked memory (kbytes, -l) 4 > max memory size (kbytes, -m) unlimited > open files (-n) 1024 > pipe size (512 bytes, -p) 8 > stack size (kbytes, -s) 10240 > cpu time (seconds, -t) unlimited > max user processes (-u) 7168 > virtual memory (kbytes, -v) unlimited > > Thanks, > Terry > > > > On 9/1/05, Fred wrote: > > My guess would be to look at your resource utilization on your system, > > most likely causes for fork() to fail are no more process slots, out of > > memory, or past some kind of per-user (non-root) limit. When this > > occurs look at your system logs, ps output and see if you have *lots* > > of processes hanging around. It could be that nagios has stopped reaping > > its children (or another unrelated process has sucked up the resources) > > and you have simply pushed your system to the edge. It might be that you > > get to that situation and it backs off before you even notice it and you > > are left with nagios having problems dealing with the aftermath. > > > > -FredC > > > > --- Terry wrote: > > > > > Hello, > > > > > > I have been having this issue for quite some time. For some unknown > > > reason, nagios stops performing checks with these errors: > > > > > > [1125536952] Warning: The check of service 'PING' on host 'hostname' > > > could not be performed due to a fork() error. The check will be > > > rescheduled. > > > > > > All checks fail like this until nagios is restarted. When this > > > problem is occuring I can run the service checks manually both as the > > > nagios user and as the root user. There are no resource problems that > > > I can see at the time. We do not appear to be hitting a limit with > > > open files or anything like that either. The nagios mirrors the root > > > user in that area. > > > > > > What could be wrong? > > > > > > Thanks! > > > > > > > > > ------------------------------------------------------- > > > SF.Net email is Sponsored by the Better Software Conference & EXPO > > > September 19-22, 2005 * San Francisco, CA * Development Lifecycle > Practices > > > Agile & Plan-Driven Development * Managing Projects & Teams * Testing > & QA > > > Security * Process Improvement & Measurement * > http://www.sqe.com/bsce5sf > > > _______________________________________________ > > > Nagios-users mailing list > > > Nagios-users at lists.sourceforge.net > > > https://lists.sourceforge.net/lists/listinfo/nagios-users > > > ::: Please include Nagios version, plugin version (-v) and OS when > reporting > > > any issue. > > > ::: Messages without supporting info will risk being sent to /dev/null > > > > > > > > > > > > > > > > > > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From jsmforum at optonline.net Tue Sep 6 22:24:22 2005 From: jsmforum at optonline.net (Jeff) Date: Tue, 6 Sep 2005 16:24:22 -0400 Subject: Perl plugin exit code? In-Reply-To: References: Message-ID: > -----Original Message----- > From: nagios-users-admin at lists.sourceforge.net > [mailto:nagios-users-admin at lists.sourceforge.net] On Behalf > Of Tedman Eng > Sent: Tuesday, September 06, 2005 16:17 > To: nagios-users at lists.sourceforge.net > Cc: 'Jeff' > Subject: RE: [Nagios-users] Perl plugin exit code? > > > > > Second part of my question has to do with perl more than nagios, > > Obviously I need to send the correct exit code to nagios. I set a > > varialble "$status" (0, 1 or 2) in the script according to > the query > > results and at the end of the script I have > > > > Exit $status; > > > > The command will not work unless "exit" is lower case. > This may be so in your script, but wasn't in your email. > (it won't hurt to look again to make sure) > > > $ perl -e '$status=0;exit $status'; echo $? > 0 > > $ perl -e '$status=2;Exit $status'; echo $? > Can't call method "Exit" without a package or object > reference at -e line 1. 255 > That's not the problem, it is lowercase in my script. My email client capitolized it because it was on it's own line. Thanks though.... Jeff ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Tue Sep 6 23:06:56 2005 From: marc at ena.com (Marc Powell) Date: Tue, 6 Sep 2005 16:06:56 -0500 Subject: Perl plugin exit code? Message-ID: > -----Original Message----- > From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users- > admin at lists.sourceforge.net] On Behalf Of Jeff > Sent: Tuesday, September 06, 2005 2:44 PM > To: nagios-users at lists.sourceforge.net > Subject: [Nagios-users] Perl plugin exit code? > > Hey all, > > I've written a perl script to check the status of some information from > one of our application databases. It seems to run fine manually but > nagios only get's (No Output!) when I try to use it live. > > Permissions on the perl script are set correctly, if I su - over to > nagios I can run the script as nagios. So I'm stumped and looking for > ideas here. Does your plugin output one and only one line of text? http://nagiosplug.sourceforge.net/developer-guidelines.html#PLUGOUTPUT > > Second part of my question has to do with perl more than nagios, > Obviously I need to send the correct exit code to nagios. I set a > varialble "$status" (0, 1 or 2) in the script according to the query > results and at the end of the script I have > > Exit $status; > > Will this give the correct exit status that nagios is looking for? Yes, do you not see the status being set properly? It's probably better to import utils.pm and exit using the name just in case the exit codes change in the future -- #! /usr/bin/perl -w use strict; use lib "/usr/local/nagios/libexec" ; use utils qw(%ERRORS &print_revision &support &usage); then you can print ""; exit $ERRORS{'OK'}; # or exit $ERRORS{'WARNING'}; # or exit $ERRORS{'CRITICAL'}; # or exit $ERRORS{'UNKNOWN'}; -- Marc ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From potus98 at yahoo.com Tue Sep 6 23:09:23 2005 From: potus98 at yahoo.com (John Christian) Date: Tue, 6 Sep 2005 14:09:23 -0700 (PDT) Subject: Status Info = Warning, but Status = Unknown Message-ID: <20050906210923.82179.qmail@web54710.mail.yahoo.com> Hello, I'm looking at the "Service Detail" section on the web GUI. Most hosts/services are working fine, but a few services seem confused. Their Status Information (7th column) is "WARNING [details...]" or "OK [details...]" but their Status (3rd column) is still listed as "UNKNOWN". Attempts to fix that did not: Restarted Nagios mutliple times. Removed host reference and re-added. Waited a few hours for Nagios to settle down. The problem *seems* to be more common when using Jeff Scott's check_load_remote 1.1 or check_uptime_remote 1.1 in combination with the remote host running Sun_SSH_1.0.1. Sometimes they will clear-up (display statuseses that make sense) but usually they're in status UNKNOWN even though the status information shows WARNING or OK. The same scripts used against hosts running OpenSSH_4.1 are always fine. Why is Nagios displaying conflicting information? How do I force Nagios to 'forget' everything about a host and start fresh? Other tips? Nagios 2.0b4 SunOS 5.9 Generic_112233-12 Sun-Fire-880 TIA for any help! -John ______________________________________________________ Click here to donate to the Hurricane Katrina relief effort. http://store.yahoo.com/redcross-donate3/ ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Tue Sep 6 23:34:44 2005 From: marc at ena.com (Marc Powell) Date: Tue, 6 Sep 2005 16:34:44 -0500 Subject: Status Info = Warning, but Status = Unknown Message-ID: > -----Original Message----- > From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users- > admin at lists.sourceforge.net] On Behalf Of John Christian > Sent: Tuesday, September 06, 2005 4:09 PM > To: nagios-users at lists.sourceforge.net > Subject: [Nagios-users] Status Info = Warning, but Status = Unknown > > Hello, > > I'm looking at the "Service Detail" section on the web > GUI. Most hosts/services are working fine, but a few > services seem confused. Their Status Information (7th > column) is "WARNING [details...]" or "OK [details...]" > but their Status (3rd column) is still listed as > "UNKNOWN". Nagios determines the Status (3rd column) from the plugin exit code _only_. The Status Information is just an explanation for human readability. It would appear that the plugins you are using are exiting with a different status code than the plugin output would suggest they should be. > The problem *seems* to be more common when using Jeff > Scott's check_load_remote 1.1 or check_uptime_remote > 1.1 in combination with the remote host running > Sun_SSH_1.0.1. Sometimes they will clear-up (display > statuseses that make sense) but usually they're in > status UNKNOWN even though the status information > shows WARNING or OK. > > The same scripts used against hosts running > OpenSSH_4.1 are always fine. Is this a hint that you're executing these plugins via check_by_ssh? Perhaps check_by_ssh isn't passing the exit code back properly or more likely it's encountering problems itself and you're seeing it's exit code. Have you tried running the commands by hand as the nagios user exactly as they are defined from your central machine and verify the exit code (echo $?)? > > Why is Nagios displaying conflicting information? The plugin exit code (very important) doesn't agree with the plugin output (not important, at least to nagios). The 'plugin' in your case could be check_load_remote or check_by_ssh if that's what you're using. > How do I force Nagios to 'forget' everything about a > host and start fresh? If you're not using state retention, restart nagios. If you're using state retention, stop nagios, remove the retention file and restart nagios. > Other tips? Check your sshd log on the remote host for errors. Enable sshd debug mode. -- Marc ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Wed Sep 7 00:02:55 2005 From: ae at op5.se (Andreas Ericsson) Date: Wed, 07 Sep 2005 00:02:55 +0200 Subject: Perl plugin exit code? In-Reply-To: References: Message-ID: <431E120F.4090306@op5.se> Jeff wrote: > Hey all, > > I've written a perl script to check the status of some information from > one of our application databases. It seems to run fine manually but > nagios only get's (No Output!) when I try to use it live. > > Permissions on the perl script are set correctly, if I su - over to > nagios I can run the script as nagios. So I'm stumped and looking for > ideas here. > Are you running Nagios with the embedded perl interpreter turned on? If so, try re-compiling without it, or make sure the mini-epn can execute the script. This can (in part) be assured by making it run without warnings with the strict pragma and the -wT options to the hashbang line. > Second part of my question has to do with perl more than nagios, > Obviously I need to send the correct exit code to nagios. I set a > varialble "$status" (0, 1 or 2) in the script according to the query > results and at the end of the script I have > > Exit $status; > > Will this give the correct exit status that nagios is looking for? > Yes. -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Lead Developer ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Ludwig.Pummer at Copart.Com Wed Sep 7 00:29:56 2005 From: Ludwig.Pummer at Copart.Com (Ludwig Pummer) Date: Tue, 06 Sep 2005 15:29:56 -0700 Subject: oscp_command never runs Message-ID: <1126045796.24890.22.camel@bender> Let me start off by saying I've already searched the mailing list for this issue and checked for issues mentioned in those messages. Of course, it's possible that I didn't find (and therefore didn't check for) the issue which is causing my problem... I'm running Nagios 1.2 compiled from source package on FreeBSD. I'm experimenting with setting up distributed + failover monitoring, so my configuration is fairly small and clean. Right now, I'm trying to make 'boba' send its service check results to 'jango' using the ocsp_command. My nagios.cfg has: ocsp_command=submit_check_result_smart ocsp_timeout=5 obsess_over_services=1 use_retained_program_state=0 I've also turned on all logging options except the initial state option. My services.cfg template for all of the services has: obsess_over_service 1 my misccommands.cfg has: define command{ command_name submit_check_result_smart command_line /usr/local/nagios/libexec/eventhandlers/submit_check_result_smart $HOSTNAME$ '$SERVICEDESC$' $SERVICESTATE$ '$OUTPUT$'` } I have 4 hosts and 6 services configured (1 PING on each host, and NRPE check_nagios on 2 of the 4 hosts) and they've been running fine. My event handler does a lookup on the hostname passed to it and then calls send_nsca. It also writes a line to its own log file with the arguments passed to it. If I manually run this event handler as the nagios user: root at boba# su -l nagios /usr/local/nagios/libexec/eventhandlers/submit_check_result_smart jango 'PING' OK 'test ping time 0.4ms' 1 data packet(s) sent to host successfully. root at boba# My event handler's log after the above manual run: submit_check_result_smart called with 1:jango 2:PING 3:OK 4:test ping time 0.4ms, found;1, return_code:0 On host jango, in nagios.log, I see: EXTERNAL COMMAND: PROCESS_SERVICE_CHECK_RESULT;jango;PING;0;test ping time 0.4ms Nagios doesn't find any errors in my config (just 1 warning about a contact not belonging to a contact group), and I've stopped it, removed the status.sav file, and restarted it. The CGI interface's Process Info page on boba says that Obessing is enabled. I see nothing in my nagios.log about the ocsp_command being run, and my submit_check_result_smart script's log file never shows that the command was run. Anyone have any ideas why the ocsp_command is not being executed? --Ludwig Pummer ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From chector at Currenex.com Wed Sep 7 01:05:25 2005 From: chector at Currenex.com (Chris Hector) Date: Tue, 06 Sep 2005 16:05:25 -0700 Subject: Odd behavior adding passive service In-Reply-To: References: Message-ID: <431E20B5.8090002@currenex.com> My goal is to add a service to an existing active nagios implementation that will catch traps and send email alerts based on severity. I've got the SNMP part working fine but am having trouble with the nagios portion. I've added a service definition similar to: # Warning SNMP traps define service{ service_description TRAPWARN hostgroup_name network-devices use generic-service is_volatile 1 active_checks_enabled 0 passive_checks_enabled 1 max_check_attempts 1 normal_check_interval 31536000 notification_interval 0 notification_options c check_command passive_check_ok } This works fine for the traps but causes nagios to schedule the active checks based on the normal_check_interval here even though it is explicitely defined in the generic-service definition, it schedules all the active checks to run one year later instead of the 3 minutes that was defined in the template eg. define service{ name generic-service ... normal_check_interval 3 } When I remove the normal_check_interval the behavior is as expected. What I want is an initial passive check based on some very small script to set it as ok instead of pending using an active check, then set the normal_check_interval to some very long period of time so it isn't run again for say a month or year after, and handle passive trap handling normally I know an alternative is to disable the active check and simple send a submit_check_result for each IP that I'm monitoring but I was going to use this for additional functionality later. ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From potus98 at yahoo.com Wed Sep 7 05:11:06 2005 From: potus98 at yahoo.com (John Christian) Date: Tue, 6 Sep 2005 20:11:06 -0700 (PDT) Subject: Status Info = Warning, but Status = Unknown In-Reply-To: References: Message-ID: <20050907031106.50147.qmail@web54703.mail.yahoo.com> For posterior's sake :-) On the monitored hosts where I was having problems, I have upgraded SSH from Sun's SSH to OpenSSH. This seems to have fixed the problem I was having. I ran some cursory tests of the scripts against two hosts (one using Sun's SSH and the other using OpenSSH). Checking the exit statuses did not provide any clues. I did not test extensively since upgrading to OpenSSH (which I'm doing on all hosts anyways) solved my problem. I was not using check_by_ssh. These scripts seem to handle the ssh-connectivity on their own. HTH a future archive searcher! -John --- Marc Powell wrote: > > > > -----Original Message----- > > From: nagios-users-admin at lists.sourceforge.net > [mailto:nagios-users- > > admin at lists.sourceforge.net] On Behalf Of John > Christian > > Sent: Tuesday, September 06, 2005 4:09 PM > > To: nagios-users at lists.sourceforge.net > > Subject: [Nagios-users] Status Info = Warning, but > Status = Unknown > > > > Hello, > > > > I'm looking at the "Service Detail" section on the > web > > GUI. Most hosts/services are working fine, but a > few > > services seem confused. Their Status Information > (7th > > column) is "WARNING [details...]" or "OK > [details...]" > > but their Status (3rd column) is still listed as > > "UNKNOWN". > > Nagios determines the Status (3rd column) from the > plugin exit code > _only_. The Status Information is just an > explanation for human > readability. It would appear that the plugins you > are using are exiting > with a different status code than the plugin output > would suggest they > should be. > > > The problem *seems* to be more common when using > Jeff > > Scott's check_load_remote 1.1 or > check_uptime_remote > > 1.1 in combination with the remote host running > > Sun_SSH_1.0.1. Sometimes they will clear-up > (display > > statuseses that make sense) but usually they're in > > status UNKNOWN even though the status information > > shows WARNING or OK. > > > > The same scripts used against hosts running > > OpenSSH_4.1 are always fine. > > Is this a hint that you're executing these plugins > via check_by_ssh? > Perhaps check_by_ssh isn't passing the exit code > back properly or more > likely it's encountering problems itself and you're > seeing it's exit > code. Have you tried running the commands by hand as > the nagios user > exactly as they are defined from your central > machine and verify the > exit code (echo $?)? > > > > > Why is Nagios displaying conflicting information? > > The plugin exit code (very important) doesn't agree > with the plugin > output (not important, at least to nagios). The > 'plugin' in your case > could be check_load_remote or check_by_ssh if that's > what you're using. > > > How do I force Nagios to 'forget' everything about > a > > host and start fresh? > > If you're not using state retention, restart nagios. > If you're using > state retention, stop nagios, remove the retention > file and restart > nagios. > > > Other tips? > > Check your sshd log on the remote host for errors. > Enable sshd debug > mode. > > -- > Marc > > > ------------------------------------------------------- > SF.Net email is Sponsored by the Better Software > Conference & EXPO > September 19-22, 2005 * San Francisco, CA * > Development Lifecycle Practices > Agile & Plan-Driven Development * Managing Projects > & Teams * Testing & QA > Security * Process Improvement & Measurement * > http://www.sqe.com/bsce5sf > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version > (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being > sent to /dev/null > ______________________________________________________ Click here to donate to the Hurricane Katrina relief effort. http://store.yahoo.com/redcross-donate3/ ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From f1216 at yahoo.com Wed Sep 7 05:47:21 2005 From: f1216 at yahoo.com (Fred) Date: Tue, 6 Sep 2005 20:47:21 -0700 (PDT) Subject: Odd behavior adding passive service In-Reply-To: <431E20B5.8090002@currenex.com> References: <431E20B5.8090002@currenex.com> Message-ID: <20050907034721.54270.qmail@web31903.mail.mud.yahoo.com> I had tried something similar, basically, a passive check definition for some static information such as memory size, # of processors, physical location, etc stuff that doesn't change. My attempt was to define the service in nagios and then when I start the nagios service (service nagios start) I wait till nagios is up and ready then run my plug-in from the service start script to populate the passive info. That worked, sortof. The problem was that once I do this on a larger system where I configure distributed monitoring the services timeout and go stale, then I lose the data. I wound up creating an active service on one node that runs the plug-in that populates the data for the other nodes. I wind up scheduling this active service to run once a day ... even though the data is static and will never change or result in a non-OK status. It does allow for nodes to come and go and not have to restart nagios to see the changes, although. Since you mention submit_check_result, I'm assuming you are setting up some kind of distributed monitoring setup. It would be a nice enhancement to allow a hook when a node comes back up after being down (via host-check), as you could then re-run any static services such as this. --- Chris Hector wrote: > My goal is to add a service to an existing active nagios implementation > that will catch traps and send email alerts based on severity. I've got > the SNMP part working fine but am having trouble with the nagios portion. > > I've added a service definition similar to: > > # Warning SNMP traps > define service{ > service_description TRAPWARN > hostgroup_name network-devices > use generic-service > is_volatile 1 > active_checks_enabled 0 > passive_checks_enabled 1 > max_check_attempts 1 > normal_check_interval 31536000 > notification_interval 0 > notification_options c > check_command passive_check_ok > } > > This works fine for the traps but causes nagios to schedule the active > checks based on the normal_check_interval here even though it is > explicitely defined in the generic-service definition, it schedules all > the active checks to run one year later instead of the 3 minutes that > was defined in the template > > eg. > define service{ > name generic-service > ... > normal_check_interval 3 > } > > When I remove the normal_check_interval the behavior is as expected. > > What I want is an initial passive check based on some very small script > to set it as ok instead of pending using an active check, then set the > normal_check_interval to some very long period of time so it isn't run > again for say a month or year after, and handle passive trap handling > normally > > I know an alternative is to disable the active check and simple send a > submit_check_result for each IP that I'm monitoring but I was going to > use this for additional functionality later. > > > ------------------------------------------------------- > SF.Net email is Sponsored by the Better Software Conference & EXPO > September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices > Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA > Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting > any issue. > ::: Messages without supporting info will risk being sent to /dev/null > ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jjk_saji at yahoo.com Wed Sep 7 07:35:46 2005 From: jjk_saji at yahoo.com (John Joseph) Date: Wed, 7 Sep 2005 06:35:46 +0100 (BST) Subject: Not able to get mail notification , Guidance requested Message-ID: <20050907053546.71558.qmail@web40810.mail.yahoo.com> Hi team I have RHEL4 , nagios 1.2 , postfix , in which MTA is working fine I had made contact , contact group , on contact details , I had given ?host-notify-by-email and notify-by-email ? for host_notification_ command and service_notification_command and for that host I have defined host " joseph" as Host "joseph" Name : joseph Alias : Joseph-TEST Address : 192.168.20.99 # Parents : Router-HO # Host Groups : HO-Server Check_command : Max_check_attempts : 3 Checks_enabled : Yes Event_handler_enabled : Nothing Event_handler : Low_flap_threshold : 0 % High_flap_threshold : 0 % Flap_detection_enabled : Nothing Process_perf_data : Nothing Retain_status_information : Yes Retain_nonstatus_information : Yes Notification_interval : 3 * 60 sec Notification_period : 24x7 Notification_options : d,u,r Notifications_enabled : Yes Stalking_options : o,d,u Status : Enabled For Services I have defined FTP as Host name : joseph Description : FTP Check Is Volatile : Nothing # Service Groups : ForServieGroup Check_command : check_ftp Check_command_arguments : Max_check_attempts : 3 Normal_check_interval : 3 * 60 sec Retry_check_interval : 3 * 60 sec Active_checks_enabled : Nothing Passive_checks_enabled : Nothing Check_period : 24x7 Parallelize_check : Nothing Obsess_over_service : Nothing Check_freshness : Nothing Freshness treshold : 0 sec Event_handler : Event_handler_arguments : Event_handler enabled : Nothing Low flap treshold : 0 % High flap treshold : 0 % Flap_detection_enabled : Nothing Process_perf_data : Nothing Retain_status_information : Nothing Retain_nonstatus_information : Nothing Notification_interval : 3 * 60 sec Notification_period : 24x7 Notification_options : w,u,c,r Notification_enabled : Yes # Contact Groups : IT-Support Stalking_options : o,w,u,c Status : Enabled I am not able to get the mail notification , when there is change for the ftp ie when FTP is stopped or start Please Guide me Thanks Joseph John ******************* My Stupid Notes http://geocities.com/jjk_saji/ ******************** ___________________________________________________________ To help you stay safe and secure online, we've developed the all new Yahoo! Security Centre. http://uk.security.yahoo.com ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mailinglist-nagios at taos-it.nl Wed Sep 7 09:01:38 2005 From: mailinglist-nagios at taos-it.nl (Maurice Lucas) Date: Wed, 7 Sep 2005 09:01:38 +0200 Subject: Problems with the check_ping plugin Message-ID: <017b01c5b379$fe02ac90$0364a8c0@mmid.local> Hello, I use the ping plugin with the following command check_command check_ping!100.0,20%!400.0,60% But I receive a lot of warning/failures with the following text in the email ***** Nagios ***** Notification Type: PROBLEM Service: PING Host: Default Gateway Address: MUNGED State: WARNING Date/Time: Wed Sept 7 08:20:32 CEST 2005 Additional Info: PING WARNING - Packet loss = 0%, RTA = 1.11 ms What could be the reason for this mailing? packet loss is 0 and the RTA is far below the thresholds. With kind regards, Met vriendelijke groet, Maurice Lucas TAOS-IT ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Wed Sep 7 10:10:35 2005 From: ae at op5.se (Andreas Ericsson) Date: Wed, 07 Sep 2005 10:10:35 +0200 Subject: Problems with the check_ping plugin In-Reply-To: <017b01c5b379$fe02ac90$0364a8c0@mmid.local> References: <017b01c5b379$fe02ac90$0364a8c0@mmid.local> Message-ID: <431EA07B.8010302@op5.se> Maurice Lucas wrote: > Hello, > > I use the ping plugin with the following command > check_command check_ping!100.0,20%!400.0,60% > > But I receive a lot of warning/failures with the following text in the > email > > ***** Nagios ***** > > Notification Type: PROBLEM > > Service: PING > Host: Default Gateway > Address: MUNGED > State: WARNING > > Date/Time: Wed Sept 7 08:20:32 CEST 2005 > > Additional Info: > > PING WARNING - Packet loss = 0%, RTA = 1.11 ms > > What could be the reason for this mailing? > packet loss is 0 and the RTA is far below the thresholds. > The /bin/ping (or equivalent) might send text to stdout. This isn't handled gracefully in check_ping. Try check_icmp instead. It has its own ICMP engine, so you won't see any of those weird errors from it. > > With kind regards, > Met vriendelijke groet, > > Maurice Lucas > TAOS-IT > > > > ------------------------------------------------------- > SF.Net email is Sponsored by the Better Software Conference & EXPO > September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices > Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA > Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. ::: Messages without supporting info will risk > being sent to /dev/null > -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Lead Developer ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From zoltan.arpadffy at essnet.se Wed Sep 7 15:25:06 2005 From: zoltan.arpadffy at essnet.se (Zoltan Arpadffy) Date: Wed, 7 Sep 2005 15:25:06 +0200 Subject: FW: servicegroups issue Message-ID: <362C9D444B0E0E409B516EBFAE5229F80366127A@HELIOS.essnet.se> hi, I am very new to nagios and try to explore as much I can appreciate with limited knowledge. I use 2.04b and it works perfect. Specially the hostgroups feature that it is possible to define services for entire hostgroup is fantastic. It makes possible monitor whole network with reasonable small config files. What I miss is a possibility to define servicegroups with hostgroup members. For example: define hostgroup{ hostgroup_name polarhome alias Polarhome Servers members redhat,debian,freebsd,netbsd,openbsd,vax,alpha,qnx,tru64,qnx,solaris,hpu x,irix,aix } define service{ use generic-service hostgroup_name polarhome service_description SMTP contact_groups polarhome-admins check_command check_smtp Now I would like to define (and it would be very useful to see) a servicegroup like: #define servicegroup{ servicegroup_name smtp-services alias SMTP Services members polarhome,SMTP,gate,SMTP } Unfortunately, this is not possible: Error: Could not find a service matching host name 'polarhome' and description 'SMTP' Error: Could not expand member services specified in servicegroup (config file '/usr/local/nagios/etc/servicegroups.cfg', starting on line 1) Am I doing something wrong - or this is not implemented at all. Thank you very much in advance. Best regards, Z -------------- next part -------------- An HTML attachment was scrubbed... URL: From m.borsani at it.net Wed Sep 7 15:40:35 2005 From: m.borsani at it.net (Marco Borsani) Date: Wed, 7 Sep 2005 15:40:35 +0200 Subject: Always a WARNING state from check_snmp Message-ID: Hi all! I am testing check_snmp on a Cisco FW. All it seems to be OK when I checking Memory or Connections, but when I checking CPU (for istance cpmCPUTotal5sec) I always receive a WARNING state. You can see (in the follwing line command) that I don't mention any warning/critial range (!) #> ./check_snmp -H HOSTADDRESS -o .1.3.6.1.4.1.9.9.109.1.1.1.1.3.1 -C public #> SNMP WARNING - 9 Regards Marco Borsani Unix & Monitoring System Administrator Technical Operation Tel. +39 010 4310115 Fax +39 010 4327454 E-mail: m.borsani at IT.net ITnet S.r.l. - Direzione e Coordinamento di WIND Telecomunicazioni S.p.A. Internet Service Provider Sede legale: Via C.G.Viola, 48 - 00148 Roma Dir. Centrale e Amministrativa: Via Pacinotti, 39 16151 Genova (Italy) http://www.it.net mailto:info at IT.net _______________________________________________________________ Altre sedi ITnet: MILANO tel.: +39 02 30114900 info-milano at IT.net ROMA tel.: +39 06 83116707 info-roma at IT.net _______________________________________________________________ ITnet is associated to CIX (Commercial IP eXchange) and RIPE ITnet is associated to AIIP (Associazione Italiana Internet Providers) ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ktynagios at richard-group.com Wed Sep 7 16:13:18 2005 From: ktynagios at richard-group.com (Kurt Yoder) Date: Wed, 7 Sep 2005 10:13:18 -0400 Subject: log monitoring? Message-ID: <49bd602560f771a4717a7ca957337607@richard-group.com> Hello list Has anyone tried using nagios to monitor log files? I'm thinking specifically about the types of things that programs like logcheck report. I looked around on Google, but didn't see anything specifically mentioning this type of monitoring. -- Kurt Yoder ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From colet at code-energy.com Wed Sep 7 16:23:19 2005 From: colet at code-energy.com (Cole Tuininga) Date: Wed, 07 Sep 2005 10:23:19 -0400 Subject: Ethernet interface selection Message-ID: <1126102999.747.9.camel@localhost> Hi all - I have a question that will probably have a quick answer either way. I have a system with two public ethernet interfaces running Nagios 1.2. Is there a simple way to make sure that all checks for external systems go through one particular interface? Thanks in advance... -- Give a person a fish and you feed them for a day. Give a person access to the net and they won't bother you for weeks. -- Internet proverb Cole Tuininga Lead Developer Code Energy, Inc colet at code-energy.com PGP Key ID: 0x43E5755D ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From schoenfeld at in-medias-res.com Wed Sep 7 16:25:03 2005 From: schoenfeld at in-medias-res.com (=?ISO-8859-15?Q?sch=F6nfeld_/_in-medias-res?=) Date: Wed, 07 Sep 2005 16:25:03 +0200 Subject: log monitoring? In-Reply-To: <49bd602560f771a4717a7ca957337607@richard-group.com> References: <49bd602560f771a4717a7ca957337607@richard-group.com> Message-ID: <431EF83F.20702@in-medias-res.com> Hi, i'm having the same problem as you, but no solution yet. Please let me know if you find something. Greets Patrick Kurt Yoder schrieb: > Hello list > > Has anyone tried using nagios to monitor log files? I'm thinking > specifically about the types of things that programs like logcheck > report. I looked around on Google, but didn't see anything specifically > mentioning this type of monitoring. > > -- > Kurt Yoder > > > > ------------------------------------------------------- > SF.Net email is Sponsored by the Better Software Conference & EXPO > September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices > Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA > Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. ::: Messages without supporting info will risk > being sent to /dev/null ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From morel.mosolff at native-instruments.de Wed Sep 7 16:24:54 2005 From: morel.mosolff at native-instruments.de (Morel Mosolff) Date: Wed, 7 Sep 2005 16:24:54 +0200 (CEST) Subject: Ethernet interface selection In-Reply-To: <1126102999.747.9.camel@localhost> References: <1126102999.747.9.camel@localhost> Message-ID: <20050907142454.17F2942672F@mail.bln.native-instruments.de> Hello, please note that I am on vacation. best, morel -- -> Morel Mosolff -> Network-/System-Technician -> NATIVE INSTRUMENTS GmbH -> morel.mosolff at native-instruments.de -> Schlesische Strasse 28 -> http://www.native-instruments.de/ -> D-10997 Berlin -> Tel. +49-30-61 10 35-83 -> Germany -> Fax +49-30-61 10 35-35 ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From richard.schneidermayer at arz.co.at Wed Sep 7 16:25:59 2005 From: richard.schneidermayer at arz.co.at (richard.schneidermayer at arz.co.at) Date: Wed, 7 Sep 2005 16:25:59 +0200 Subject: Richard Schneidermayer/ARZ/AT ist =?ISO-8859-1?Q?au=DFer_Haus=2E?= Message-ID: Ich werde ab 22.08.2005 nicht im B?ro sein. Ich kehre zur?ck am 09.09.2005. Ich werde Ihre Nachricht nach meiner R?ckkehr beantworten. ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Wed Sep 7 16:33:38 2005 From: marc at ena.com (Marc Powell) Date: Wed, 7 Sep 2005 09:33:38 -0500 Subject: Always a WARNING state from check_snmp Message-ID: > -----Original Message----- > From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users- > admin at lists.sourceforge.net] On Behalf Of Marco Borsani > Sent: Wednesday, September 07, 2005 8:41 AM > To: NAGIOS > Subject: [Nagios-users] Always a WARNING state from check_snmp > Importance: High > > > Hi all! > > I am testing check_snmp on a Cisco FW. > All it seems to be OK when I checking Memory or Connections, but when I > checking CPU (for istance cpmCPUTotal5sec) I always receive a WARNING > state. > > You can see (in the follwing line command) that I don't mention any > warning/critial range (!) > > #> ./check_snmp -H HOSTADDRESS -o .1.3.6.1.4.1.9.9.109.1.1.1.1.3.1 -C > public > #> SNMP WARNING - 9 Since SNMP data can be anything, and it doesn't have any idea what you're checking based on the OID, how is the plugin supposed to know whether that result is good or not unless you provide warning and critical ranges? IMHO the plugin behavior is appropriate in this case. As for it working for your memory test, there are generic default values defined in check_snmp.c (powers of 2) that your memory and connection tests apparently fall within. -- Marc ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Wed Sep 7 16:52:37 2005 From: marc at ena.com (Marc Powell) Date: Wed, 7 Sep 2005 09:52:37 -0500 Subject: log monitoring? Message-ID: > -----Original Message----- > From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users- > admin at lists.sourceforge.net] On Behalf Of Kurt Yoder > Sent: Wednesday, September 07, 2005 9:13 AM > To: nagios-users at lists.sourceforge.net > Subject: [Nagios-users] log monitoring? > > Hello list > > Has anyone tried using nagios to monitor log files? I'm thinking > specifically about the types of things that programs like logcheck > report. I looked around on Google, but didn't see anything specifically > mentioning this type of monitoring. http://thread.gmane.org/gmane.network.nagios.user/26352 looks to touch on two of the most common methods -- check_log2.pl and sec+nsca. -- Marc ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Wed Sep 7 17:00:14 2005 From: marc at ena.com (Marc Powell) Date: Wed, 7 Sep 2005 10:00:14 -0500 Subject: Ethernet interface selection Message-ID: > -----Original Message----- > From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users- > admin at lists.sourceforge.net] On Behalf Of Cole Tuininga > Sent: Wednesday, September 07, 2005 9:23 AM > To: nagios-users at lists.sourceforge.net > Subject: [Nagios-users] Ethernet interface selection > > > Hi all - I have a question that will probably have a quick answer either > way. > > I have a system with two public ethernet interfaces running Nagios 1.2. > Is there a simple way to make sure that all checks for external systems > go through one particular interface? > > Thanks in advance... This has been discussed numerous times on the list. The synopsis is that your operating system controls which interface to use based on the routes you have created to the destination networks. -- Marc ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From m.borsani at it.net Wed Sep 7 17:05:48 2005 From: m.borsani at it.net (Marco Borsani) Date: Wed, 7 Sep 2005 17:05:48 +0200 Subject: R: Always a WARNING state from check_snmp In-Reply-To: References: Message-ID: Check_snmp does not know how to interpretate the results, but if I don't insert any ranges it should give me an OK state (like it did for memory and connections). check_snmp gives me ALWAYS warning , whichever results it receives! Marco -}Since SNMP data can be anything, and it doesn't have any idea what -}you're checking based on the OID, how is the plugin supposed to know -}whether that result is good or not unless you provide warning and -}critical ranges? IMHO the plugin behavior is appropriate in this case. -} -}As for it working for your memory test, there are generic default values -}defined in check_snmp.c (powers of 2) that your memory and connection -}tests apparently fall within. -} -}-- -}Marc -} -} -}------------------------------------------------------- -}SF.Net email is Sponsored by the Better Software Conference & EXPO -}September 19-22, 2005 * San Francisco, CA * Development Lifecycle -}Practices -}Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA -}Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf -}_______________________________________________ -}Nagios-users mailing list -}Nagios-users at lists.sourceforge.net -}https://lists.sourceforge.net/lists/listinfo/nagios-users -}::: Please include Nagios version, plugin version (-v) and OS -}when reporting any issue. -}::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Wed Sep 7 17:11:45 2005 From: marc at ena.com (Marc Powell) Date: Wed, 7 Sep 2005 10:11:45 -0500 Subject: Not able to get mail notification , Guidance requested Message-ID: > -----Original Message----- > From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users- > admin at lists.sourceforge.net] On Behalf Of John Joseph > Sent: Wednesday, September 07, 2005 12:36 AM > To: Nagios User > Subject: [Nagios-users] Not able to get mail notification , Guidance > requested > > Hi team > I have RHEL4 , nagios 1.2 , postfix , in which MTA is > working fine > I had made contact , contact group , on contact > details , I had given "host-notify-by-email and > notify-by-email " for host_notification_ command and > service_notification_command > and for that host > > > I have defined host " joseph" as > > Host "joseph" > Name : joseph > Alias : Joseph-TEST > Address : 192.168.20.99 > # Parents : Router-HO > # Host Groups : HO-Server > Check_command : > Max_check_attempts : 3 > Checks_enabled : Yes > Event_handler_enabled : Nothing > Event_handler : > Low_flap_threshold : 0 % > High_flap_threshold : 0 % > Flap_detection_enabled : Nothing > Process_perf_data : Nothing > Retain_status_information : Yes > Retain_nonstatus_information : Yes > Notification_interval : 3 * 60 sec > Notification_period : 24x7 > Notification_options : d,u,r > Notifications_enabled : Yes > Stalking_options : o,d,u > Status : Enabled > > For Services > I have defined FTP as > > Host name : joseph > Description : FTP Check > Is Volatile : Nothing > # Service Groups : ForServieGroup > Check_command : check_ftp > Check_command_arguments : > Max_check_attempts : 3 > Normal_check_interval : 3 * 60 sec > Retry_check_interval : 3 * 60 sec > Active_checks_enabled : Nothing > Passive_checks_enabled : Nothing > Check_period : 24x7 > Parallelize_check : Nothing > Obsess_over_service : Nothing > Check_freshness : Nothing > Freshness treshold : 0 sec > Event_handler : > Event_handler_arguments : > Event_handler enabled : Nothing > Low flap treshold : 0 % > High flap treshold : 0 % > Flap_detection_enabled : Nothing > Process_perf_data : Nothing > Retain_status_information : Nothing > Retain_nonstatus_information : Nothing > Notification_interval : 3 * 60 sec > Notification_period : 24x7 > Notification_options : w,u,c,r > Notification_enabled : Yes > # Contact Groups : IT-Support > Stalking_options : o,w,u,c > Status : Enabled > > > I am not able to get the mail notification , when > there is change for the ftp ie when FTP is stopped or > start The above information indicates that you don't have active or passive checks enabled for the service. Is that the case? If so, you're not checking the service so no notifications will ever go out. If you are checking the service and the information above is incorrect, check nagios.log for a notification attempt. Check your postfix logs. Verify that you can send a notification by issuing your notification commands exactly as they are defined as the nagios user (not root!) - post the test here. If you still have problems, please post the exact host and service definitions as well as your notification commands to this list - the information above is not them. Nagios.log entries around the time that the notification should happen would be useful as well. -- Marc ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Wed Sep 7 17:14:23 2005 From: marc at ena.com (Marc Powell) Date: Wed, 7 Sep 2005 10:14:23 -0500 Subject: Status Info = Warning, but Status = Unknown Message-ID: Thanks for posting the followup =) -- Marc > -----Original Message----- > From: John Christian [mailto:potus98 at yahoo.com] > Sent: Tuesday, September 06, 2005 10:11 PM > To: Marc Powell; nagios-users at lists.sourceforge.net > Subject: RE: [Nagios-users] Status Info = Warning, but Status = Unknown > > For posterior's sake :-) > > On the monitored hosts where I was having problems, I > have upgraded SSH from Sun's SSH to OpenSSH. This > seems to have fixed the problem I was having. > > I ran some cursory tests of the scripts against two > hosts (one using Sun's SSH and the other using > OpenSSH). Checking the exit statuses did not provide > any clues. I did not test extensively since upgrading > to OpenSSH (which I'm doing on all hosts anyways) > solved my problem. > > I was not using check_by_ssh. These scripts seem to > handle the ssh-connectivity on their own. > > HTH a future archive searcher! > -John > > --- Marc Powell wrote: > > > > > > > -----Original Message----- > > > From: nagios-users-admin at lists.sourceforge.net > > [mailto:nagios-users- > > > admin at lists.sourceforge.net] On Behalf Of John > > Christian > > > Sent: Tuesday, September 06, 2005 4:09 PM > > > To: nagios-users at lists.sourceforge.net > > > Subject: [Nagios-users] Status Info = Warning, but > > Status = Unknown > > > > > > Hello, > > > > > > I'm looking at the "Service Detail" section on the > > web > > > GUI. Most hosts/services are working fine, but a > > few > > > services seem confused. Their Status Information > > (7th > > > column) is "WARNING [details...]" or "OK > > [details...]" > > > but their Status (3rd column) is still listed as > > > "UNKNOWN". > > > > Nagios determines the Status (3rd column) from the > > plugin exit code > > _only_. The Status Information is just an > > explanation for human > > readability. It would appear that the plugins you > > are using are exiting > > with a different status code than the plugin output > > would suggest they > > should be. > > > > > The problem *seems* to be more common when using > > Jeff > > > Scott's check_load_remote 1.1 or > > check_uptime_remote > > > 1.1 in combination with the remote host running > > > Sun_SSH_1.0.1. Sometimes they will clear-up > > (display > > > statuseses that make sense) but usually they're in > > > status UNKNOWN even though the status information > > > shows WARNING or OK. > > > > > > The same scripts used against hosts running > > > OpenSSH_4.1 are always fine. > > > > Is this a hint that you're executing these plugins > > via check_by_ssh? > > Perhaps check_by_ssh isn't passing the exit code > > back properly or more > > likely it's encountering problems itself and you're > > seeing it's exit > > code. Have you tried running the commands by hand as > > the nagios user > > exactly as they are defined from your central > > machine and verify the > > exit code (echo $?)? > > > > > > > > Why is Nagios displaying conflicting information? > > > > The plugin exit code (very important) doesn't agree > > with the plugin > > output (not important, at least to nagios). The > > 'plugin' in your case > > could be check_load_remote or check_by_ssh if that's > > what you're using. > > > > > How do I force Nagios to 'forget' everything about > > a > > > host and start fresh? > > > > If you're not using state retention, restart nagios. > > If you're using > > state retention, stop nagios, remove the retention > > file and restart > > nagios. > > > > > Other tips? > > > > Check your sshd log on the remote host for errors. > > Enable sshd debug > > mode. > > > > -- > > Marc > > > > > > > ------------------------------------------------------- > > SF.Net email is Sponsored by the Better Software > > Conference & EXPO > > September 19-22, 2005 * San Francisco, CA * > > Development Lifecycle Practices > > Agile & Plan-Driven Development * Managing Projects > > & Teams * Testing & QA > > Security * Process Improvement & Measurement * > > http://www.sqe.com/bsce5sf > > _______________________________________________ > > Nagios-users mailing list > > Nagios-users at lists.sourceforge.net > > > https://lists.sourceforge.net/lists/listinfo/nagios-users > > ::: Please include Nagios version, plugin version > > (-v) and OS when reporting any issue. > > ::: Messages without supporting info will risk being > > sent to /dev/null > > > > > > > > ______________________________________________________ > Click here to donate to the Hurricane Katrina relief effort. > http://store.yahoo.com/redcross-donate3/ ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Leonard.Miller at baesystems.com Wed Sep 7 17:26:58 2005 From: Leonard.Miller at baesystems.com (Leonard Miller) Date: Wed, 07 Sep 2005 10:26:58 -0500 Subject: Service Group Question Message-ID: Hi, I've been using Nagios for a few weeks now and like it a lot, but now I am starting to get into more depth with it. Which brings me to my question regarding service groups. If I setup a service define service{ use generic-service host_name HOSTNAME1 service_description Uptime is_volatile 0 check_period 24x7 max_check_attempts 3 normal_check_interval 2 retry_check_interval 1 contact_groups wireless-admins notification_interval 240 notification_period 24x7 notification_options w,u,c,r check_command check_snmp_uptime!$USER5$ } then setup the service group to check multiple hosts define servicegroup{ servicegroup_name wireless_radios alias Wireless Uptime members HOSTNAME1,UPTIME HOSTNAME2,UPTIME HOSTNAME3,UPTIME } What is to keep Nagios from doing a redundant check of HOSTNAME1? Maybe I'm thinking too hard. Thanks in advance Leonard ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jsmforum at optonline.net Wed Sep 7 17:45:36 2005 From: jsmforum at optonline.net (Jeff) Date: Wed, 7 Sep 2005 11:45:36 -0400 Subject: Perl plugin exit code? In-Reply-To: References: Message-ID: ALL, Thanks for the suggestions. I did manage to get it to work. I had to use strict and declare all my variables. Thanks, Jeff > -----Original Message----- > From: nagios-users-admin at lists.sourceforge.net > [mailto:nagios-users-admin at lists.sourceforge.net] On Behalf > Of Andreas Ericsson > Sent: Tuesday, September 06, 2005 18:03 > To: nagios-users at lists.sourceforge.net > Subject: Re: [Nagios-users] Perl plugin exit code? > > > Jeff wrote: > > Hey all, > > > > I've written a perl script to check the status of some information > > from one of our application databases. It seems to run > fine manually > > but nagios only get's (No Output!) when I try to use it live. > > > > Permissions on the perl script are set correctly, if I su - over to > > nagios I can run the script as nagios. So I'm stumped and > looking for > > ideas here. > > > > Are you running Nagios with the embedded perl interpreter > turned on? If > so, try re-compiling without it, or make sure the mini-epn > can execute > the script. This can (in part) be assured by making it run without > warnings with the strict pragma and the -wT options to the > hashbang line. > > > Second part of my question has to do with perl more than nagios, > > Obviously I need to send the correct exit code to nagios. I set a > > varialble "$status" (0, 1 or 2) in the script according to > the query > > results and at the end of the script I have > > > > Exit $status; > > > > Will this give the correct exit status that nagios is looking for? > > > > Yes. > > -- > Andreas Ericsson andreas.ericsson at op5.se > OP5 AB www.op5.se > Lead Developer > > > ------------------------------------------------------- > SF.Net email is Sponsored by the Better Software Conference & > EXPO September 19-22, 2005 * San Francisco, CA * Development > Lifecycle Practices Agile & Plan-Driven Development * > Managing Projects & Teams * Testing & QA Security * Process > Improvement & Measurement * http://www.sqe.com/bsce5sf > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS > when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From chandresh.suthar at gmail.com Wed Sep 7 18:22:14 2005 From: chandresh.suthar at gmail.com (Chandresh Suthar) Date: Wed, 7 Sep 2005 21:52:14 +0530 Subject: please help me Message-ID: Hi, Please help me in sending notifications. I have only one configuration file for contactfgroup,hostsgroup etc. "minimal.cfg" minimal.cfg : define timeperiod{ timeperiod_name 24x7 alias 24 Hours A Day, 7 Days A Week sunday 00:00-24:00 monday 00:00-24:00 tuesday 00:00-24:00 wednesday 00:00-24:00 thursday 00:00-24:00 friday 00:00-24:00 saturday 00:00-24:00 } define command{ command_name notify-by-email command_line /usr/bin/printf "%b" "***** Nagios *****\n\nNotification Type: $NOTIFICATIONTYPE$\n\nService: $SERVICEDESC$\nHost: $HOSTALIAS$\nAddress: $HOSTADDRESS$\nState: $SERVICESTATE$\n\nDate/Time: $LONGDATETIME$\n\nAdditional Info:\n\n$OUTPUT$" | /bin/mail -s "** $NOTIFICATIONTYPE$ alert - $HOSTALIAS$/$SERVICEDESC$ is $SERVICESTATE$ **" $CONTACTEMAIL$ } # This is a sample host notification command that can be used to send email # notifications (about host alerts) to contacts. define command{ command_name host-notify-by-email command_line /usr/bin/printf "%b" "***** Nagios *****\n\nNotification Type: $NOTIFICATIONTYPE$\nHost: $HOSTNAME$\nState: $HOSTSTATE$\nAddress: $HOSTADDRESS$\nInfo: $OUTPUT$\n\nDate/Time: $LONGDATETIME$\n" | /bin/mail -s "Host $HOSTSTATE$ alert for $HOSTNAME$!" $CONTACTEMAIL$ } define contact{ contact_name nagios alias Nagios Admin service_notification_period 24x7 host_notification_period 24x7 service_notification_options w,u,c,r host_notification_options d,r service_notification_commands notify-by-email host_notification_commands host-notify-by-email email nagios at localhost } define contact{ contact_name chandresh alias Administrator service_notification_period 24x7 host_notification_period 24x7 service_notification_options w,u,c,r host_notification_options d,r service_notification_commands notify-by-email host_notification_commands host-notify-by-email email "personalemail id" } define contactgroup{ contactgroup_name admins alias Nagios Administrators members nagios } define contactgroup{ contactgroup_name admingrp alias Administrators members chandresh } define host{ name generic-host ; The name of this host template notifications_enabled 1 ; Host notifications are enabled event_handler_enabled 1 ; Host event handler is enabled flap_detection_enabled 1 ; Flap detection is enabled failure_prediction_enabled 1 ; Failure prediction is enabled process_perf_data 1 ; Process performance data retain_status_information 1 ; Retain status information across program restarts retain_nonstatus_information 1 ; Retain non-status information across program restarts register 0 ; DONT REGISTER THIS DEFINITION - ITS NOT A REAL HOST, JUST A TEMPLATE! } define host{ use generic-host ; Name of host template to use host_name webmail.com alias localhost address 127.0.0.1 check_command check-host-alive max_check_attempts 10 notification_interval 120 notification_period 24x7 notification_options d,r contact_groups admingrp } define service{ use generic-service ; Name of service template to use host_name webmail.com service_description HTTP is_volatile 0 check_period 24x7 max_check_attempts 4 normal_check_interval 5 retry_check_interval 1 contact_groups admingrp notification_interval 960 notification_period 24x7 check_command check_http } I am able to send mails to my persoanl email id by "mail" command. But notification is not going. I have configured sendmail. I am not even receiving mails on local system at nagios at localhost( I tried). I am even not getting any error messages in logs of nagios. Please help. -------------- next part -------------- An HTML attachment was scrubbed... URL: From marc at ena.com Wed Sep 7 18:27:27 2005 From: marc at ena.com (Marc Powell) Date: Wed, 7 Sep 2005 11:27:27 -0500 Subject: oscp_command never runs Message-ID: > -----Original Message----- > From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users- > admin at lists.sourceforge.net] On Behalf Of Ludwig Pummer > Sent: Tuesday, September 06, 2005 5:30 PM > To: nagios-users at lists.sourceforge.net > Subject: [Nagios-users] oscp_command never runs > > Let me start off by saying I've already searched the mailing list for this > issue and checked for issues mentioned in those messages. Of course, it's > possible that I didn't find (and therefore didn't check for) the issue > which is causing my problem... > > I'm running Nagios 1.2 compiled from source package on FreeBSD. I'm > experimenting with setting up distributed + failover monitoring, so my > configuration is fairly small and clean. Right now, I'm trying to make > 'boba' send its service check results to 'jango' using the ocsp_command. > > My nagios.cfg has: > ocsp_command=submit_check_result_smart > ocsp_timeout=5 > obsess_over_services=1 > use_retained_program_state=0 > I've also turned on all logging options except the initial state option. > > My services.cfg template for all of the services has: > obsess_over_service 1 > > my misccommands.cfg has: > define command{ > command_name submit_check_result_smart > command_line > /usr/local/nagios/libexec/eventhandlers/submit_check_result_smart > $HOSTNAME$ '$SERVICEDESC$' $SERVICESTATE$ '$OUTPUT$'` > } > > I have 4 hosts and 6 services configured (1 PING on each host, and NRPE > check_nagios on 2 of the 4 hosts) and they've been running fine. > > My event handler does a lookup on the hostname passed to it and then calls > send_nsca. It also writes a line to its own log file with the arguments > passed to it. > > If I manually run this event handler as the nagios user: > root at boba# su -l nagios > /usr/local/nagios/libexec/eventhandlers/submit_check_result_smart jango > 'PING' OK 'test ping time 0.4ms' > 1 data packet(s) sent to host successfully. > root at boba# > > My event handler's log after the above manual run: > submit_check_result_smart called with 1:jango 2:PING 3:OK 4:test ping time > 0.4ms, found;1, return_code:0 > > On host jango, in nagios.log, I see: > EXTERNAL COMMAND: PROCESS_SERVICE_CHECK_RESULT;jango;PING;0;test ping time > 0.4ms > > Nagios doesn't find any errors in my config (just 1 warning about a > contact not belonging to a contact group), and I've stopped it, removed > the status.sav file, and restarted it. The CGI interface's Process Info > page on boba says that Obessing is enabled. I see nothing in my nagios.log > about the ocsp_command being run, and my submit_check_result_smart > script's log file never shows that the command was run. > > Anyone have any ideas why the ocsp_command is not being executed? Not really. Everything looks correct as detailed above. My next step would be to simplify the submit_check_result_smart command to at least give a direction to look -- define command{ command_name submit_check_result_smart command_line /bin/echo "I ran for '$SERVICEDESC$' on $HOSTNAME$" > /tmp/ocsp } Make sure that nagios really is restarting as well. Nagios doesn't normally log OSCP execution but you may be able to see those by recompiling with a higher debug level. There haven't been any problems with the OCSP code for as long as I can remember so you probably don't need to do that. -- Marc ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Leonard.Miller at baesystems.com Wed Sep 7 18:45:01 2005 From: Leonard.Miller at baesystems.com (Leonard Miller) Date: Wed, 07 Sep 2005 11:45:01 -0500 Subject: Service Group Question Message-ID: I think I get it. I use Hostgroup_name instead of Host_name. Right? >>> "Leonard Miller" 9/7/2005 11:26:58 AM >>> Hi, I've been using Nagios for a few weeks now and like it a lot, but now I am starting to get into more depth with it. Which brings me to my question regarding service groups. If I setup a service define service{ use generic-service host_name HOSTNAME1 service_description Uptime is_volatile 0 check_period 24x7 max_check_attempts 3 normal_check_interval 2 retry_check_interval 1 contact_groups wireless-admins notification_interval 240 notification_period 24x7 notification_options w,u,c,r check_command check_snmp_uptime!$USER5$ } then setup the service group to check multiple hosts define servicegroup{ servicegroup_name wireless_radios alias Wireless Uptime members HOSTNAME1,UPTIME HOSTNAME2,UPTIME HOSTNAME3,UPTIME } What is to keep Nagios from doing a redundant check of HOSTNAME1? Maybe I'm thinking too hard. Thanks in advance Leonard \ ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Wed Sep 7 18:45:45 2005 From: marc at ena.com (Marc Powell) Date: Wed, 7 Sep 2005 11:45:45 -0500 Subject: please help me Message-ID: > -----Original Message----- > From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users- > admin at lists.sourceforge.net] On Behalf Of Chandresh Suthar > Sent: Wednesday, September 07, 2005 11:22 AM > To: nagios-users at lists.sourceforge.net > Subject: [Nagios-users] please help me > > Hi, > > Please help me in sending notifications. I have only one configuration > file for contactfgroup,hostsgroup etc. "minimal.cfg" > [good stuff removed] > define command{ > command_name notify-by-email > command_line /usr/bin/printf "%b" "***** Nagios > *****\n\nNotification Type: $NOTIFICATIONTYPE$\n\nService: > $SERVICEDESC$\nHost: $HOSTALIAS$\nAddress: $HOSTADDRESS$\nState: > $SERVICESTATE$\n\nDate/Time: $LONGDATETIME$\n\nAdditional > Info:\n\n$OUTPUT$" | /bin/mail -s "** $NOTIFICATIONTYPE$ alert - > $HOSTALIAS$/$SERVICEDESC$ is $SERVICESTATE$ **" $CONTACTEMAIL$ > } > > Have you tried executing this exactly as defined above (with substitutions of course) as the nagios user? [good stuff removed] > > define contact{ > contact_name chandresh > alias Administrator > service_notification_period 24x7 > host_notification_period 24x7 > service_notification_options w,u,c,r > host_notification_options d,r > service_notification_commands notify-by-email > host_notification_commands host-notify-by-email > email "personalemail id" > } Quotes aren't necessary but I don't expect they'd be a problem. [good stuff removed] > define service{ > use generic-service ; Name of > service template to use > host_name webmail.com > service_description HTTP > is_volatile 0 > check_period 24x7 > max_check_attempts 4 > normal_check_interval 5 > retry_check_interval 1 > contact_groups admingrp > notification_interval 960 > notification_period 24x7 > check_command check_http > } You do not appear to have any notification_options for this service. Without it nagios will not notify for this service. This is most likely your problem based on the information provided. > I am able to send mails to my persoanl email id by "mail" command. But > notification is not going. I have configured sendmail. I am not even > receiving mails on local system at nagios at localhost > ( I tried). I am even not getting any error > messages in logs of nagios. Do you see a notification attempt in nagios.log? maillog? -- Marc ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From prosolutions at gmx.net Wed Sep 7 19:06:28 2005 From: prosolutions at gmx.net (prosolutions at gmx.net) Date: Wed, 7 Sep 2005 10:06:28 -0700 Subject: Nagios Failover system with state sync - possible? Message-ID: <20050907170628.GE3283@think.alaya.mine.nu> Is it possible to have a nagios setup in which the state information on monitored hosts is continuously synced from master to slave such that in the event of failover to the slave all state data will be retained? The model I have in my head is of the way MySQL maintains real-time replication between a master and slave and can thus handle failovers gracefully via heartbeat. ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Ludwig.Pummer at Copart.Com Wed Sep 7 19:19:18 2005 From: Ludwig.Pummer at Copart.Com (Ludwig Pummer) Date: Wed, 7 Sep 2005 10:19:18 -0700 Subject: oscp_command never runs Message-ID: > -----Original Message----- > From: nagios-users-admin at lists.sourceforge.net > [mailto:nagios-users-admin at lists.sourceforge.net] On Behalf > Of Marc Powell > Sent: Wednesday, September 07, 2005 9:27 AM > To: nagios-users at lists.sourceforge.net > Subject: RE: [Nagios-users] oscp_command never runs > > > > > -----Original Message----- > > From: nagios-users-admin at lists.sourceforge.net > [mailto:nagios-users- > > admin at lists.sourceforge.net] On Behalf Of Ludwig Pummer > > Sent: Tuesday, September 06, 2005 5:30 PM > > To: nagios-users at lists.sourceforge.net > > Subject: [Nagios-users] oscp_command never runs ... > > my misccommands.cfg has: > > define command{ > > command_name submit_check_result_smart > > command_line > > /usr/local/nagios/libexec/eventhandlers/submit_check_result_smart > > $HOSTNAME$ '$SERVICEDESC$' $SERVICESTATE$ '$OUTPUT$'` } ... > > Anyone have any ideas why the ocsp_command is not being executed? > > Not really. Everything looks correct as detailed above. My > next step would be to simplify the submit_check_result_smart > command to at least give a direction to look -- > > define command{ > command_name submit_check_result_smart > command_line /bin/echo "I ran for '$SERVICEDESC$' on > $HOSTNAME$" > /tmp/ocsp > } > > Make sure that nagios really is restarting as well. Nagios > doesn't normally log OSCP execution but you may be able to > see those by recompiling with a higher debug level. There > haven't been any problems with the OCSP code for as long as I > can remember so you probably don't need to do that. > > -- > Marc Your suggested command worked fine, so I went back to figuring out why mine didn't. Configuring with --enable-DEBUG0 --enable-DEBUG1 --enable-DEBUG2 --enable-DEBUG3 --enable-DEBUG4 made for a much noisier Nagios, but at least I was able to debug. FYI, the CGI directory won't compile if you have those debugging messages turned on. I eventually tracked it down to an extra backtick (`) in my command_line above. When I ran the debug nagios in non-daemon mode, sh displayed "Syntax error: EOF in backquote substitution" when my ocsp_command was run. Thanks for your help. ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Praveenm at niit.com Wed Sep 7 19:18:05 2005 From: Praveenm at niit.com (Praveen Muthyala Manohar) Date: Wed, 7 Sep 2005 13:18:05 -0400 Subject: Evaluate a string (GET/POST) Message-ID: Hi All, Using Nagios, can I evaluate a URL to return string using GET and POST methods? Most commercial monitoring tools have this feature. Thanks. ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Praveenm at niit.com Wed Sep 7 19:40:59 2005 From: Praveenm at niit.com (Praveen Muthyala Manohar) Date: Wed, 7 Sep 2005 13:40:59 -0400 Subject: check_oracle Message-ID: Would appreciate if someone can share all possible checkcommand.cfg definitions for check_oracle. Thanks a ton. ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From f1216 at yahoo.com Wed Sep 7 19:52:58 2005 From: f1216 at yahoo.com (Fred) Date: Wed, 7 Sep 2005 10:52:58 -0700 (PDT) Subject: Nagios Failover system with state sync - possible? In-Reply-To: <20050907170628.GE3283@think.alaya.mine.nu> References: <20050907170628.GE3283@think.alaya.mine.nu> Message-ID: <20050907175258.20901.qmail@web31909.mail.mud.yahoo.com> I would bet if you replicate your nagios configuration to another node and have your status.sav file on common highly available storage and the main monitoring node fails, a backup node could start nagios using the same configuration and pick right up more or less ... You don't need replication, but even if you did, you could probably just sync out the status.sav file (assuming you have nagios configured to flush it out in a reasonable interval, the default is 15seconds I think) and then copy the status.sav file to the backup node just in case. -FredC --- prosolutions at gmx.net wrote: > Is it possible to have a nagios setup in which the state information on > monitored hosts is continuously synced from master to slave such that in > the event of failover to the slave all state data will be retained? > > The model I have in my head is of the way MySQL maintains real-time > replication between a master and slave and can thus handle failovers > gracefully via heartbeat. > > > > > > ------------------------------------------------------- > SF.Net email is Sponsored by the Better Software Conference & EXPO > September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices > Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA > Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting > any issue. > ::: Messages without supporting info will risk being sent to /dev/null > ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From f1216 at yahoo.com Wed Sep 7 20:03:35 2005 From: f1216 at yahoo.com (Fred) Date: Wed, 7 Sep 2005 11:03:35 -0700 (PDT) Subject: Service check delays in distributed monitor setup Message-ID: <20050907180335.14632.qmail@web31915.mail.mud.yahoo.com> I think I have found the source of my issue with distributed monitoring and service checks. It turns out that if you enable distributed monitoring, even passive service check definitions seem to get scheduled to run when nagios starts up. If you have say 10350 services (give or take one) and use smart scheduling of services, you could easily see 3+ hours between the time that the first service is scheduled and the last one. Changing the smart schduling to "n" for no delay causes the services to not be scheduled in the future, but by the time nagios processes the entire configuration file, the start time is in the past and I think nagios forgets about the service so it is never scheduled again. I'm currently trying a service_inter_check_delay_method=0.05 which puts me at about 3 minutes for 10,000+ services, which seems to be enough time for nagios to startup and still have its first pending service scheduled in the near future rather then the near past ... Does this make sense to anyone who has been messing with these configuration settings? Is there a better way to do this? I.e., I would like for nagios to *not* consider the passive checks in any scheduling. I actually only have a small number of active checks which when run will populate the rest of the passive checks for the entire cluster, the problem is that it seems the node that I run these checks on is alphabetically *after* all of the other nodes so it seems to be scheduled last and has services starting the furthest out. Thanks. -FredC ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Wed Sep 7 20:06:11 2005 From: ae at op5.se (Andreas Ericsson) Date: Wed, 07 Sep 2005 20:06:11 +0200 Subject: Evaluate a string (GET/POST) In-Reply-To: References: Message-ID: <431F2C13.2090201@op5.se> Praveen Muthyala Manohar wrote: > Hi All, > > Using Nagios, can I evaluate a URL to return string using GET and POST > methods? > Yes. RTFHO for details. > Most commercial monitoring tools have this feature. > I'd expect no less. It's a no-brainer codewise, so the tools that doesn't should be taken somewhere and shot. -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Lead Developer ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Wed Sep 7 20:09:11 2005 From: ae at op5.se (Andreas Ericsson) Date: Wed, 07 Sep 2005 20:09:11 +0200 Subject: check_oracle In-Reply-To: References: Message-ID: <431F2CC7.5070907@op5.se> Praveen Muthyala Manohar wrote: > Would appreciate if someone can share all possible checkcommand.cfg > definitions for check_oracle. There's really a near-infinite number of combinations, depending on what options you're using, and what your databases and their tables are named. ./check_oracle --help and a minute or two of pondering should do the trick for you. > Thanks a ton. > You're welcome. -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Lead Developer ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Wed Sep 7 20:29:07 2005 From: marc at ena.com (Marc Powell) Date: Wed, 7 Sep 2005 13:29:07 -0500 Subject: roomity.com spam Message-ID: Is anyone else getting off-list spam from roomity.com? They clearly state they got my address from this list with an 'opt-out' e-mail address. -- Marc ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From tjn at umn.edu Wed Sep 7 20:29:11 2005 From: tjn at umn.edu (Travis J. Noll) Date: Wed, 07 Sep 2005 13:29:11 -0500 Subject: partial service_perfdata_command failure Message-ID: <431F3177.9070203@umn.edu> Thanks in advance for any thoughts on this matter. I've been handed the keys to a generally neglected nagios install and been given marching orders to get load / mem usage graphs up and running. I found a partially configured nagiostat in our code tree and have gotten it to work on some occasions. My problem is that the service_perfdata_command does not seem to be running for all services. I have the lines: # nagiostat service_perfdata_command=service-perf-data-handler but have also reverted to uncommenting: # testing #service_perfdata_command=process-service-perfdata where I was able to see both in /usr/local/nagios/nagiostat/debug.log and /tmp/service-perfdata that not all of the service updates available via the web interface are being "post-processed" One host is handing off disk usage information to the service_perfdata_command but not load or memory services. Other hosts are able to pipe load and memory usage information fine. Another host appears to have no information available in either of the expected log files, though it stays current within the web interface to nagios. Sorry to be such a feeb, but I've only been working on this a couple days and have been unable to find an answer in docs, google, or archives. Please let me know what configuration information I can dig up to be helpful. Thanks again, Travis ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From RLAdams at Kelsey-Seybold.com Wed Sep 7 20:55:07 2005 From: RLAdams at Kelsey-Seybold.com (Adams, Russell L.) Date: Wed, 7 Sep 2005 13:55:07 -0500 Subject: Nagios Failover system with state sync - possible? In-Reply-To: <20050907170628.GE3283@think.alaya.mine.nu> References: <20050907170628.GE3283@think.alaya.mine.nu> Message-ID: <20050907185507.GL18813@pingu.ksnet.com> Heartbeat and DRBD. Russell On Wed, Sep 07, 2005 at 10:06:28AM -0700, prosolutions at gmx.net wrote: > Is it possible to have a nagios setup in which the state information on > monitored hosts is continuously synced from master to slave such that in > the event of failover to the slave all state data will be retained? > > The model I have in my head is of the way MySQL maintains real-time replication between a master and slave and can thus handle failovers gracefully via heartbeat. > > > > > > ------------------------------------------------------- > SF.Net email is Sponsored by the Better Software Conference & EXPO > September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices > Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA > Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nagios-list at dale.us Wed Sep 7 22:35:37 2005 From: nagios-list at dale.us (Dale Blount) Date: Wed, 07 Sep 2005 16:35:37 -0400 Subject: Questions Galore Message-ID: <1126125337.8742.11.camel@dale.velocity.net> Hey Guys, I'm finally finishing up my Nagios installation and have a few questions that I haven't been able to find good answers to. 1) I have one host that has a bad mobo/NIC/driver or something that times out every so often. The box is going away soon so I don't really care to fix it, however almost every time this box, another certain hosts SMTP service goes offline as well. Could this be caused by Nagios' scheduling order (same host always goes down with it) or do I just have a genuine network problem? 2) I have a couple contacts with pager address defined, but not all host groups they belong to should send pages. Should I make this person two contacts - user-email and user-page? Add both of them to hosts groups where they should get paged or is there a cleaner solution. 3) Does anyone have a good way to get alerts to AIM addresses? Everything I can find is incomplete. 4) Every so often I get hosts which are physically unreachable. check_ping however, just times out at the -t X setting instead of reporting unreachable. Any ideas? Thanks for all the help, Dale ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Wed Sep 7 22:53:32 2005 From: ae at op5.se (Andreas Ericsson) Date: Wed, 07 Sep 2005 22:53:32 +0200 Subject: Questions Galore In-Reply-To: <1126125337.8742.11.camel@dale.velocity.net> References: <1126125337.8742.11.camel@dale.velocity.net> Message-ID: <431F534C.2090305@op5.se> Dale Blount wrote: > Hey Guys, > > > I'm finally finishing up my Nagios installation and have a few questions > that I haven't been able to find good answers to. > > > 1) I have one host that has a bad mobo/NIC/driver or something that > times out every so often. The box is going away soon so I don't really > care to fix it, however almost every time this box, another certain > hosts SMTP service goes offline as well. Could this be caused by > Nagios' scheduling order (same host always goes down with it) or do I > just have a genuine network problem? > Without further investigation it's impossible to say for sure, but it sounds like a network problem. > 2) I have a couple contacts with pager address defined, but not all host > groups they belong to should send pages. Should I make this person two > contacts - user-email and user-page? Add both of them to hosts groups > where they should get paged or is there a cleaner solution. > It's normal to define two contacts, as the two addresses are distrinctly different targets. People tend to associate contacts with people rather than notification targets, which messes up the planning. > 3) Does anyone have a good way to get alerts to AIM addresses? > Everything I can find is incomplete. > www.nagiosexchange.com (or .org, I can't remember) has some stuff on notifications. Otherwise, google is your friend. It should be possible to hack something up with that text-based IM-thingie. Can't remember its name right now, but it supports MSN, ICQ, AIM and most of the other common things. > 4) Every so often I get hosts which are physically unreachable. > check_ping however, just times out at the -t X setting instead of > reporting unreachable. Any ideas? > This happens because a switch or router on the same network still has the MAC-address in its ARP-tables. There's really no solution to this the first time the plugin is run. check_icmp does a better job of detecting it, but will most likely also time out the first time or two it's run, given the low timeout thresholds generally in use with nagios plugins compared to the higher ones used in most networks. > > Thanks for all the help, > > Dale > > > > ------------------------------------------------------- > SF.Net email is Sponsored by the Better Software Conference & EXPO > September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices > Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA > Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Lead Developer ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From misc at viceconsulting.co.nz Wed Sep 7 22:56:50 2005 From: misc at viceconsulting.co.nz (misc at viceconsulting.co.nz) Date: Thu, 8 Sep 2005 08:56:50 +1200 (NZST) Subject: Nagios spawning rogue nagios processes eventually crashing Nagios server Message-ID: <37061.127.0.0.1.1126126610.squirrel@www.goldenfields.co.nz> Hi All, I'm having a bit of a problem with my Central Nagios server - it is leaking memory because it is spawning nagios processes without closing them. Note: information such as OS flavour and version, architecture, compile options etc, are listed at the end of this email. I have 6 Nagios servers, 5 are distributed, and report to the the 6th Nagios server (the Central Nagios server). All Nagios servers are installed from a custom built RPM. Details of the build environment are listed below. Distributed Nagios servers are sending service check results to the Central Nagios server via NSCA. The central Nagios server is running Nagios v2.0b4. I am using NSCA v2.4. The central Nagios server is receiving passive check results from the 5 distributed servers. It is receiving results from 82 hosts and 1300 services. I believe the reason Nagios is leaking memory has something to do with processing the performance data. I am using nagiosgraph v0.4 (nagiosgraph.sf.net) to process performance data. I am using the default processing method (ie nagiosgraph is run everytime a service check result is received by the central Nagios server). As per the documentation, I believe everytime a service check is received, Nagios will spawn a new Nagios instance to run the performance processing command. The problem is, that over time, hundreds of rogue Nagios processes end up running on the Central Nagios server and never closing themselves. Each rogue Nagios process chews up memory, and eventually the machine runs out of memory and swap, rendering the machine unusable (has to be rebooted). Each rogue nagios process is listed as having process 1 as its parent, rather than the master nagios process, which is strange, so it appears to be getting separated from its parent at some point. I believe this may be caused by many processes competing to write to the same file (possibly /var/log/nagios/rw/nagios.cmd), but due to locking or race conditions being unable to and thus remaining running permanently. The performance processing command being run is: /usr/bin/perl /usr/local/nagiosrrd/insert_fast.pl "$LASTSERVICECHECK$||$HOSTNAME$||$SERVICEDESC$||$SERVICEOUTPUT$||$SERV ICEPERFDATA$" The command is being run via Perl itself, I have not compiled in embedded Perl support. (Perl version) # perl -v This is perl, v5.8.0 built for i386-linux-thread-multi (with 1 registered patch, see perl -V for more detail) I have no problems whatsoever if I reduce the number of hosts sending their results to the Central Nagios server. With 8 hosts and about 100 services, there is no memory loss. The CPU usage % is about 10% (5% IO wait, 3% system, 2% user). I also have no problems with memory loss if I disable processing performance data. With 1300 services reporting, this equates to about 4 services per second (check performed every 5 minutes). There is never more than 1 or 2 perl processes running, so Perl is running the performance processing script fine and exiting. With 1300 services reporting, the CPU usage % is at 100% continually (1-3% IO wait, 80-90% system, 10-15% user). The load average isn't too bad: # uptime 19:52:13 up 3:24, 1 user, load average: 2.40, 2.78, 2.57 Sample snippet of the process listing (full process listing below): (Currently after being up 3.5 hours, there are 227 nagios processes, and 131 nsca processes running). ... nagios 27775 1 0 17:05 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 30478 1 0 17:09 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 31653 1 0 17:10 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 465 1 0 17:12 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 834 1 0 17:13 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 1935 1 0 17:14 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 5738 1 0 17:19 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 6068 1 0 17:20 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 7110 1 0 17:21 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 8154 1 0 17:22 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 9233 1 0 17:24 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 9681 1 0 17:24 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 10437 1 0 17:25 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg ... nagios 1248 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1250 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1251 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1252 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1253 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1254 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1255 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1256 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d Running strace on these rogue processes did not allow me to conclusively figure out what was going on: # strace -p 27775 Process 27775 attached - interrupt to quit write(7, "hostname1\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0"..., 512) = 512 write(7, "hostname2\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0"..., 512) = 512 write(7, "hostname3\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0"..., 512) = 512 write(7, "hostname4\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0"..., 512) = 512 write(7, "hostname5\0\0\0\0\0\0\0\0\0\0\0"..., 512) = 512 write(7, "hostname6\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0"..., 512) = 512 write(7, "hostname7\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0"..., 512) = 512 write(7, "hostname8\0\0\0\0\0\0\0\0\0\0"..., 512) = 512 ... exit_group(0) = ? >From 5-100 hostnames listed on every strace I try. It seems strange, it basically writes a list of some of the hosts it is monitoring. Running strace on a rogue process once or sometimes twice, seems to "revive" it and then it exits on its own accord. Note: the actual hostnames have been replaced with "hostnameX". The actual hostnames are not in any particular order in the strace output. They are not in alphabetical order. # strace -p 23454 Process 23454 attached - interrupt to quit write(7, "hostname41\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0"..., 512) = 512 write(7, "hostname42\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0"..., 512) = 512 write(7, "hostname43\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0"..., 512) = 512 write(7, "hostname44\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0"..., 512) = 512 write(7, "hostname45\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0"..., 512) = 512 write(7, "hostname46\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0"..., 512) = 512 exit_group(0) = ? Process 23454 detached Running strace on some of the nsca processes is possibly more enlightening: # ps -ef | grep 1280 nagios 1280 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d # strace -p 1280 Process 1280 attached - interrupt to quit open("/var/log/nagios/rw/nagios.cmd", O_WRONLY|O_CREAT|O_TRUNC, 0666) = 3 fstat64(3, {st_mode=S_IFIFO|0660, st_size=0, ...}) = 0 mmap2(NULL, 4096, PROT_READ|PROT_WRITE, MAP_PRIVATE|MAP_ANONYMOUS, - 1, 0) = 0xb75f6000 write(3, "[1126067362] PROCESS_SERVICE_CHE"..., 138) = 138 close(3) = 0 munmap(0xb75f6000, 4096) = 0 time([1126080660]) = 1126080660 recv(5, "", 720, 0) = 0 munlock(0x8208358, 56) = 0 munlock(0x8208398, 4168) = 0 munlock(0x8207148, 12) = 0 close(5) = 0 exit_group(0) = ? Process 1280 detached As per the nagios processes, running strace on the process seems to "revive" the process and it exits itself on its own accord. NSCA was also installed from a custom built RPM (built on the same environment as the Nagios RPM). NSCA is using blowfish encryption. lsof output of a rogue nagios process: # /usr/sbin/lsof -p 4775 COMMAND PID USER FD TYPE DEVICE SIZE NODE NAME nagios 4775 nagios cwd DIR 8,7 1024 2 / nagios 4775 nagios rtd DIR 8,7 1024 2 / nagios 4775 nagios txt REG 8,6 402680 39127 /usr/bin/nagios nagios 4775 nagios mem REG 8,7 1571824 81602 /lib/tls/libc- 2.3.2.so nagios 4775 nagios mem REG 8,7 1792244 42919 /lib/libnss_ldap- 2.3.2.so nagios 4775 nagios mem REG 8,7 76540 42887 /lib/libresolv- 2.3.2.so nagios 4775 nagios mem REG 8,6 10912 6201 /usr/lib/sasl/libcrammd5.so.1.0.19 nagios 4775 nagios mem REG 8,6 30724 6205 /usr/lib/sasl/libdigestmd5.so.0.0.20 nagios 4775 nagios mem REG 8,6 385252 93994 /usr/kerberos/lib/libkrb5.so.3.1 nagios 4775 nagios mem REG 8,6 13004 6256 /usr/lib/sasl/libgssapiv2.so.1.0.19 nagios 4775 nagios mem REG 8,6 76712 93933 /usr/kerberos/lib/libgssapi_krb5.so.2.2 nagios 4775 nagios mem REG 8,6 11844 93939 /usr/kerberos/lib/libdes425.so.3.0 nagios 4775 nagios mem REG 8,6 5900 6188 /usr/lib/sasl/libanonymous.so.1.0.17 nagios 4775 nagios mem REG 8,6 8144 6264 /usr/lib/sasl/liblogin.so.0.0.7 nagios 4775 nagios mem REG 8,7 30488 42910 /lib/libpam.so.0.75 nagios 4775 nagios mem REG 8,7 51952 42875 /lib/libnss_files-2.3.2.so nagios 4775 nagios mem REG 8,7 23388 42853 /lib/libcrypt- 2.3.2.so nagios 4775 nagios mem REG 8,6 72552 93952 /usr/kerberos/lib/libk5crypto.so.3.0 nagios 4775 nagios mem REG 8,6 22808 49102 /usr/lib/libltdl.so.3.1.0 nagios 4775 nagios mem REG 8,7 97712 81611 /lib/tls/libpthread-0.60.so nagios 4775 nagios mem REG 8,7 8548 42907 /lib/liblaus.so.1.0.0 nagios 4775 nagios mem REG 8,7 18632 42872 /lib/libnss_dns- 2.3.2.so nagios 4775 nagios mem REG 8,7 14868 42855 /lib/libdl- 2.3.2.so nagios 4775 nagios mem REG 8,6 7896 6268 /usr/lib/sasl/libplain.so.1.0.16 nagios 4775 nagios mem REG 8,7 213508 81606 /lib/tls/libm- 2.3.2.so nagios 4775 nagios mem REG 8,7 106912 42842 /lib/ld-2.3.2.so nagios 4775 nagios mem REG 8,6 5540 93936 /usr/kerberos/lib/libcom_err.so.3.0 nagios 4775 nagios 0r CHR 1,3 9675 /dev/null nagios 4775 nagios 1w CHR 1,3 9675 /dev/null nagios 4775 nagios 2w CHR 1,3 9675 /dev/null nagios 4775 nagios 3u IPv4 2139 TCP central-nagios- server:32789->ldap-server:ldap (ESTABLISHED) nagios 4775 nagios 4u REG 8,2 5 76244 /var/run/nagios.pid nagios 4775 nagios 5r FIFO 8,17 97538 /var/log/nagios/rw/nagios.cmd nagios 4775 nagios 7w FIFO 0,5 2206 pipe lsof output of an nsca process: # /usr/sbin/lsof -p 1263 COMMAND PID USER FD TYPE DEVICE SIZE NODE NAME nsca 1263 nagios cwd DIR 8,7 1024 2 / nsca 1263 nagios rtd DIR 8,7 1024 2 / nsca 1263 nagios txt REG 8,6 34684 26692 /usr/sbin/nsca nsca 1263 nagios mem REG 8,6 188392 49144 /usr/lib/libmcrypt.so.4.4.7 nsca 1263 nagios mem REG 8,7 51952 42875 /lib/libnss_files-2.3.2.so nsca 1263 nagios mem REG 8,7 1571824 81602 /lib/tls/libc-2.3.2.so nsca 1263 nagios mem REG 8,7 91040 42859 /lib/libnsl- 2.3.2.so nsca 1263 nagios mem REG 8,7 106912 42842 /lib/ld- 2.3.2.so nsca 1263 nagios 0r CHR 1,3 9675 /dev/null nsca 1263 nagios 1w CHR 1,3 9675 /dev/null nsca 1263 nagios 2w CHR 1,3 9675 /dev/null nsca 1263 nagios 4u unix 0xf7a10180 1539 socket nsca 1263 nagios 5u IPv4 2002 TCP central- nagios-server:5667->192.168.148.16:41799 (CLOSE_WAIT) ldd output: # ldd /usr/bin/nagios libm.so.6 => /lib/tls/libm.so.6 (0x00116000) libpthread.so.0 => /lib/tls/libpthread.so.0 (0x00f57000) libltdl.so.3 => /usr/lib/libltdl.so.3 (0x00db9000) libc.so.6 => /lib/tls/libc.so.6 (0x00138000) /lib/ld-linux.so.2 => /lib/ld-linux.so.2 (0x00696000) libdl.so.2 => /lib/libdl.so.2 (0x00e0e000) # ldd /usr/sbin/nsca libmcrypt.so.4 => /usr/lib/libmcrypt.so.4 (0x00666000) libnsl.so.1 => /lib/libnsl.so.1 (0x00dcd000) libc.so.6 => /lib/tls/libc.so.6 (0x00aff000) /lib/ld-linux.so.2 => /lib/ld-linux.so.2 (0x00434000) Inspecting the number of filehandles in use, I don't believe that is a problem: # cat /proc/sys/fs/file-nr 1277 121 104857 Contents of /var/log/nagios: drwxr-xr-x 3 nagios nagios 4.0K Sep 7 00:00 archives -rw-rw-r-- 1 nagios nagios 240 Aug 29 10:28 comments.dat -rw-rw-r-- 1 nagios nagios 241 Aug 29 10:28 downtime.dat drwx------ 2 root root 16K Jun 15 08:33 lost+found -rw-rw-r-- 1 nagios nagios 46M Sep 7 20:44 nagios.log -rw-r--r-- 1 nagios nagios 11M Sep 7 20:44 nagiosrrd.log -rw-r--r-- 1 nagios nagios 581K Sep 7 16:29 nsca.dump -rw-r--r-- 1 nagios nagios 1.0M Sep 7 16:29 objects.cache -rw------- 1 nagios nagios 1.6M Sep 7 20:29 retention.dat drwxrwxr-x 8 nagios nagios 4.0K Jul 19 14:49 rrd drwxr-sr-x 2 nagios apache 4.0K Sep 7 16:29 rw -rw-rw-r-- 1 nagios nagios 1.5M Sep 7 20:44 status.dat # ls -lh rw/ total 0 prw-rw---- 1 nagios apache 0 Sep 7 20:45 nagios.cmd I would be much obliged if anyone had any suggestions as to what is going on here, and possible suggestions to resolve this problem. If there is any more information or data you need me to provide to troubleshoot, please let me know and I will supply the relevant information. Cheers, Alex. ---- Central Nagios server details (Build environment details follow) ---- Architecture: x86, Intel Xeon 2.66GHz VMware guest (VMware esx 2.1.2), 1G ram RedHat ES 3 Update 5 Configure options: %configure \ --datadir="%{_datadir}/nagios" \ --libexecdir="%{_libdir}/nagios/plugins" \ --localstatedir="%{_localstatedir}/log/nagios" \ --sbindir="%{_libdir}/nagios/cgi" \ --sysconfdir="%{_sysconfdir}/nagios" \ --with-cgiurl="/nagios/cgi-bin" \ --with-command-user="apache" \ --with-command-grp="apache" \ --with-gd-lib="%{_libdir}" \ --with-gd-inc="%{_includedir}" \ --with-init-dir="%{_initrddir}" \ --with-htmurl="/nagios" \ --with-lockfile="%{_localstatedir}/run/nagios.pid" \ --with-mail="/bin/mail" \ --with-nagios-user="nagios" \ --with-nagios-grp="nagios" \ --with-template-objects \ --with-template-extinfo NSCA configure options: %configure \ --with-nsca-user="nagios" \ --with-nsca-grp="nagios" \ --with-nsca-port="5667" Library versions: # rpm -qa | grep -i lib glib2-2.2.3-2.0 libacl-2.2.3-1 cracklib-dicts-2.7-22 libtool-libs-1.4.3-6 elfutils-libelf-0.94-1 libuser-0.51.7-1.EL3.3 libpng-1.2.2-25 libart_lgpl-2.3.11-2 glibc-2.3.2-95.33 bzip2-libs-1.0.2-11.EL3.4 libstdc++-3.2.3-52 bind-libs-9.2.4-7_EL3 glibc-headers-2.3.2-95.33 cracklib-2.7-22 glib-1.2.10-11.1 libattr-2.2.0-1 libtermcap-2.0.8-35 zlib-1.1.4-8.1 libcap-1.10-15.1 libjpeg-6b-30 libmcrypt-2.5.7-1_ES3 libxml2-2.5.10-7 glibc-common-2.3.2-95.33 laus-libs-0.1-70RHEL3 rpm-libs-4.2.3-21_nonptl libgcc-3.2.3-52 krb5-libs-1.2.7-47 glibc-kernheaders-2.4-8.34.1 glibc-devel-2.3.2-95.33 Complete process list: UID PID PPID C STIME TTY TIME CMD root 1 0 0 16:27 ? 00:00:08 init root 2 1 0 16:27 ? 00:00:00 [keventd] root 3 1 0 16:27 ? 00:00:00 [kapmd] root 4 1 0 16:27 ? 00:00:00 [ksoftirqd/0] root 7 1 0 16:27 ? 00:00:00 [bdflush] root 5 1 0 16:27 ? 00:00:00 [kswapd] root 6 1 0 16:27 ? 00:01:19 [kscand] root 8 1 0 16:27 ? 00:00:04 [kupdated] root 9 1 0 16:28 ? 00:00:00 [mdrecoveryd] root 17 1 0 16:28 ? 00:00:01 [kjournald] root 554 1 0 16:28 ? 00:00:00 [kjournald] root 555 1 0 16:28 ? 00:00:00 [kjournald] root 556 1 0 16:28 ? 00:00:00 [kjournald] root 557 1 0 16:28 ? 00:00:01 [kjournald] root 558 1 0 16:28 ? 00:00:01 [kjournald] root 559 1 0 16:28 ? 00:00:03 [kjournald] root 716 1 0 16:28 ? 00:00:00 [vmmemctl] root 736 1 0 16:28 ? 00:00:01 /usr/sbin/vmware- guestd --background /var/run/vmware-guestd.pid root 894 1 0 16:29 ? 00:00:00 syslogd -m 0 root 898 1 0 16:29 ? 00:00:00 klogd -x root 927 1 0 16:29 ? 00:00:00 /usr/sbin/sshd root 956 1 0 16:29 ? 00:00:00 /bin/sh /usr/bin/safe_mysqld --defaults-file=/etc/my.cnf mysql 980 956 0 16:29 ? 00:02:13 /usr/libexec/mysqld -- defaults-file=/etc/my.cnf --basedir=/usr --datadir=/var/lib/mysql -- user=mysql --pid-file=/var/run/mysqld/mysqld.pid --skip-locking nagios 1013 1 0 16:29 ? 00:00:59 nsca -c /etc/nagios/nsca.cfg -d root 1064 1 0 16:29 ? 00:00:00 /usr/libexec/postfix/master nagios 1071 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d postfix 1075 1064 0 16:29 ? 00:00:00 nqmgr -l -n qmgr -t fifo -u nagios 1076 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1078 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1079 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1080 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1082 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1084 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1089 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1091 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1092 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1094 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1095 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1097 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1101 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1102 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1106 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1108 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1114 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1115 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1119 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1123 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1126 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1127 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1132 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1133 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1134 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1137 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1138 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1139 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1142 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1143 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1144 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1148 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1149 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1152 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1154 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1155 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1156 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1157 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1158 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1159 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1160 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1161 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1162 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1163 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1164 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1165 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1166 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1167 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1168 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1169 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1170 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1171 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1172 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1173 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1174 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1175 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1176 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1177 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1178 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1179 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1180 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1181 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1182 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1183 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1184 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1185 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d root 1186 1 0 16:29 ? 00:00:00 /usr/sbin/httpd nagios 1190 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1191 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1192 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1194 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1198 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1199 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d root 1201 1 0 16:29 ? 00:00:00 crond nagios 1203 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1204 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1208 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1210 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1212 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1213 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1216 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1217 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1218 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1219 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1220 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1221 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1223 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1224 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1225 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d root 1227 1 0 16:29 ? 00:00:00 cfservd nagios 1228 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1230 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1233 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1234 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1236 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1239 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1240 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1241 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1246 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1247 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1248 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1250 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1251 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1253 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1254 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1255 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1256 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1257 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1258 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1259 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1260 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1261 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1262 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1263 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1264 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d apache 1266 1186 0 16:29 ? 00:00:07 /usr/sbin/httpd apache 1268 1186 0 16:29 ? 00:00:06 /usr/sbin/httpd apache 1269 1186 0 16:29 ? 00:00:06 /usr/sbin/httpd apache 1270 1186 0 16:29 ? 00:00:06 /usr/sbin/httpd nagios 1271 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1272 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d apache 1273 1186 0 16:29 ? 00:00:07 /usr/sbin/httpd apache 1274 1186 0 16:29 ? 00:00:06 /usr/sbin/httpd apache 1275 1186 0 16:29 ? 00:00:06 /usr/sbin/httpd nagios 1276 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d apache 1277 1186 0 16:29 ? 00:00:06 /usr/sbin/httpd nagios 1278 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1279 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1281 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1282 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1283 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1284 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1285 1013 0 16:29 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 1345 1 2 16:29 ? 00:04:37 /usr/bin/nagios -d /etc/nagios/nagios.cfg root 1500 1 0 16:29 tty1 00:00:00 /sbin/mingetty tty1 root 1501 1 0 16:29 tty2 00:00:00 /sbin/mingetty tty2 root 1502 1 0 16:29 tty3 00:00:00 /sbin/mingetty tty3 root 1503 1 0 16:29 tty4 00:00:00 /sbin/mingetty tty4 root 1504 1 0 16:29 tty5 00:00:00 /sbin/mingetty tty5 root 1505 1 0 16:29 tty6 00:00:00 /sbin/mingetty tty6 apache 3705 1186 0 16:32 ? 00:00:07 /usr/sbin/httpd nagios 19439 1 0 16:54 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 22551 1 0 16:58 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 23280 1 0 16:59 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 25642 1 0 17:02 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 26058 1 0 17:03 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 27230 1 0 17:04 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 30478 1 0 17:09 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 31653 1 0 17:10 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 465 1 0 17:12 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 1935 1 0 17:14 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 5738 1 0 17:19 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 6068 1 0 17:20 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 7110 1 0 17:21 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 9233 1 0 17:24 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 9681 1 0 17:24 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 10437 1 0 17:25 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 13359 1 0 17:29 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 15349 1 0 17:32 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 16239 1 0 17:33 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 16562 1 0 17:34 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 17727 1 0 17:35 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 19382 1 0 17:37 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 20844 1 0 17:39 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 21588 1 0 17:40 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 24250 1 0 17:44 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 25264 1 0 17:45 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 26719 1 0 17:47 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 27292 1 0 17:48 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 28024 1 0 17:49 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 28350 1 0 17:49 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 28991 1 0 17:50 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 29804 1 0 17:51 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 30360 1 0 17:52 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 32666 1 0 17:55 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 1739 1 0 17:57 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 3424 1 0 17:59 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 4800 1 0 18:01 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 5382 1 0 18:02 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 6753 1 0 18:04 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 9300 1 0 18:07 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 9944 1 0 18:08 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 10442 1 0 18:09 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 11714 1 0 18:10 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 12669 1 0 18:12 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 14042 1 0 18:14 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 14693 1 0 18:15 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 18456 1 0 18:20 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 18765 1 0 18:20 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 18977 1 0 18:20 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 21475 1 0 18:24 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 22472 1 0 18:25 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 23420 1 0 18:26 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 23942 1 0 18:27 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 24208 1 0 18:27 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 25500 1 0 18:29 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 26188 1 0 18:30 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 26525 1 0 18:30 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 27637 1 0 18:32 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 27832 1 0 18:32 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 28961 1 0 18:34 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 29299 1 0 18:34 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 30387 1 0 18:36 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 31308 1 0 18:37 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 31799 1 0 18:38 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 32578 1 0 18:39 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 353 1 0 18:39 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 1477 1 0 18:40 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 2185 1 0 18:41 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 2718 1 0 18:42 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 3875 1 0 18:44 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 4868 1 0 18:45 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 5166 1 0 18:45 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 6330 1 0 18:47 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 6961 1 0 18:48 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 7669 1 0 18:49 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 7955 1 0 18:49 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 9401 1 0 18:51 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 11359 1 0 18:54 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 11461 1 0 18:54 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 12303 1 0 18:55 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 12500 1 0 18:55 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 13506 1 0 18:57 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 13777 1 0 18:57 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 14842 1 0 18:59 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 15841 1 0 19:00 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 16187 1 0 19:00 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 17191 1 0 19:02 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 18499 1 0 19:04 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 19145 1 0 19:05 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 19899 1 0 19:05 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 20915 1 0 19:07 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 21541 1 0 19:08 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg root 21792 21689 0 19:09 pts/0 00:00:00 ksu root 21812 21792 0 19:09 pts/0 00:00:06 /bin/bash nagios 21880 1 0 19:09 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 22341 1 0 19:09 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 23239 1 0 19:10 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 23334 1 0 19:10 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 23562 1 0 19:11 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 23977 1 0 19:11 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 24256 1 0 19:12 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 24668 1 0 19:12 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 24909 1 0 19:13 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 25543 1 0 19:14 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 25795 1 0 19:14 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 26371 1 0 19:15 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 27075 1 0 19:16 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 27824 1 0 19:17 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 27998 1 0 19:17 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 28446 1 0 19:18 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 28984 1 0 19:19 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 29100 1 0 19:19 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 29787 1 0 19:20 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 30128 1 0 19:20 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg ntp 30392 1 0 19:20 ? 00:00:00 ntpd -U ntp -p /var/run/ntpd.pid nagios 30479 1 0 19:20 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 31194 1 0 19:21 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 31507 1 0 19:22 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 31759 1 0 19:22 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 31889 1 0 19:22 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 32029 1 0 19:23 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 32204 1 0 19:23 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 321 1 0 19:24 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 459 1 0 19:24 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 644 1 0 19:24 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 913 1 0 19:25 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 999 1 0 19:25 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 1748 1 0 19:25 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 2016 1 0 19:26 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 2181 1 0 19:26 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 2763 1 0 19:27 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 3084 1 0 19:27 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 3182 1 0 19:27 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 3333 1 0 19:28 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 4094 1 0 19:29 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 4212 1 0 19:29 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 4794 1 0 19:30 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 5006 1 0 19:30 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 5458 1 0 19:31 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 5742 1 0 19:31 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 6092 1 0 19:32 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 6279 1 0 19:32 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 6443 1 0 19:32 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 6758 1 0 19:33 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 6969 1 0 19:33 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 7284 1 0 19:34 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 7733 1 0 19:34 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 8617 1 0 19:35 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 9232 1 0 19:36 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 9490 1 0 19:36 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 9732 1 0 19:37 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 9970 1 0 19:37 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 10401 1 0 19:38 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 10919 1 0 19:39 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 11052 1 0 19:39 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 11379 1 0 19:39 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 11602 1 0 19:40 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 11729 1 0 19:40 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 12111 1 0 19:40 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 12283 1 0 19:40 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 12438 1 0 19:41 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 12969 1 0 19:41 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 13662 1 0 19:42 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 13813 1 0 19:43 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 13960 1 0 19:43 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 14575 1 0 19:44 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 15088 1 0 19:45 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 15515 1 0 19:45 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 15748 1 0 19:45 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 15760 1 0 19:45 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 16054 1 0 19:46 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 16233 1 0 19:46 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 16417 1 0 19:46 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 16797 1 0 19:47 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 16996 1 0 19:47 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 17123 1 0 19:48 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg postfix 17708 1064 0 19:49 ? 00:00:00 pickup -l -t fifo -u nagios 17931 1 0 19:49 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 18015 1 0 19:49 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 18220 1 0 19:49 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 18691 1 0 19:50 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 18845 1 0 19:50 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 18939 1 0 19:50 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 19531 1 0 19:51 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 19626 1 0 19:51 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 19785 1 0 19:51 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 20204 1 0 19:52 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 20304 1 0 19:52 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 20594 1 0 19:53 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 20926 1 0 19:53 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 21030 1 0 19:53 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 21406 1 0 19:54 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 21641 1 0 19:54 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 21758 1 0 19:54 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 22176 1 0 19:55 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 22234 1 0 19:55 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 22566 1 0 19:55 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 22722 1 0 19:56 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 23255 1 0 19:56 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 23407 1 0 19:57 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 23590 1 0 19:57 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 24251 1 0 19:58 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 24406 1 0 19:58 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 24916 1 0 19:59 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 25248 1 0 19:59 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 25350 1 0 19:59 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 25424 1 0 20:00 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 25735 1 0 20:00 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 25852 1 0 20:00 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 26039 1 0 20:00 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 26575 1 0 20:01 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 26802 1 0 20:01 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 27155 1 0 20:02 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 27883 1 0 20:03 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 28249 1 0 20:04 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 28603 1 0 20:04 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 29548 1 0 20:05 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 29634 1 0 20:05 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 29685 1 0 20:05 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 30047 1 0 20:06 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 30195 1 0 20:06 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 30275 1 0 20:06 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg root 30396 1 0 20:07 ? 00:00:00 /usr/local/sbin/cfagent --inform -v root 30398 30396 0 20:07 ? 00:00:00 [ifconfig ] nagios 30425 1 0 20:07 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 30525 1 0 20:07 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 30592 1 0 20:07 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 30630 1 0 20:07 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 30736 1 0 20:07 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 30878 1 0 20:07 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 31502 1 0 20:08 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 31535 1 0 20:08 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 32265 1 0 20:09 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 32471 1 0 20:10 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 420 1 0 20:10 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 696 1 0 20:10 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 724 1 0 20:10 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 726 1 0 20:10 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 742 1 0 20:10 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 1121 1 0 20:11 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 1478 1 0 20:11 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 1711 1 0 20:12 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 1865 1 0 20:12 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 2361 1 0 20:13 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 2742 1 0 20:13 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 3091 1 0 20:14 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 3530 1 0 20:15 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 3694 1013 0 20:15 ? 00:00:00 nsca -c /etc/nagios/nsca.cfg -d nagios 3711 1345 0 20:15 ? 00:00:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg # cat /var/run/nagios.pid 1345 ---- RPM Build environment details ---- Compiled with: # gcc -v Reading specs from /usr/lib/gcc-lib/i386-redhat-linux/3.2.3/specs Configured with: ../configure --prefix=/usr --mandir=/usr/share/man -- infodir=/usr/share/info --enable-shared --enable-threads=posix -- disable-checking --with-system-zlib --enable-__cxa_atexit --host=i386- redhat-linux Thread model: posix gcc version 3.2.3 20030502 (Red Hat Linux 3.2.3-20) Compiled on: RedHat ES 3 Update 3 Library versions: # rpm -qa | grep -i lib bzip2-libs-1.0.2-11 libtermcap-2.0.8-35 libjpeg-6b-30 libpng-1.2.2-16 libwvstreams-3.70-10 perl-libxml-perl-0.07-28 libcap-1.10-15 XFree86-Mesa-libGLU-4.3.0-35.EL libxml2-python-2.5.10-5 libglade2-2.0.1-3 libgnomeui-2.2.1-1 cdparanoia-libs-alpha9.8-15 libungif-4.1.0-15 kdelibs-3.1.3-6.2 libxml-1.8.17-9.1 glibc-profile-2.3.2-95.3 libaio-devel-0.3.96-3 libmudflap-devel-3.5ssa-0.20030801.41 libgnat-3.2.3-20 libstdc++-ssa-devel-3.5ssa-0.20030801.41 libxml2-devel-2.5.10-5 libgcj-ssa-devel-3.5ssa-0.20030801.41 libuser-devel-0.51.7-1 ant-libs-1.5.2-20 libao-devel-0.8.3-3 libpng-devel-1.2.2-16 libole2-devel-0.2.4-6 libIDL-devel-0.8.0-9 libgnomeprint22-devel-2.2.1.3-5 libgnome-devel-2.2.2-6 libgnomeui-devel-2.2.1-1 kdelibs-devel-3.1.3-6.2 glibc-common-2.3.2-95.3 elfutils-libelf-0.89-1 glib-1.2.10-11.1 libacl-2.2.3-1 cracklib-dicts-2.7-22 krb5-libs-1.2.7-19 libstdc++-3.2.3-20 XFree86-libs-data-4.3.0-35.EL libgcj-3.2.3-20 libtiff-3.5.7-13 rhnlib-1.3-12 XFree86-Mesa-libGL-4.3.0-35.EL perl-libxml-enno-1.02-29 libxml2-2.5.10-5 libart_lgpl-2.3.11-2 libIDL-0.8.0-9 libvorbis-1.0-7 libgnomecanvas-2.2.0.2-2 libwnck-2.2.1-3 libgnome-2.2.2-6 librsvg2-2.2.3-2 libghttp-1.0.9-8.1 libraw1394-0.9.0-9 imlib-1.9.13-12 libgail-gnome-1.0.2-1 libgnomeprintui22-2.2.1.3-2.0 libsoup-1.99.26-1 libgal2-1.99.10-1 libaio-0.3.96-3 libcap-devel-1.10-15 libmudflap-3.5ssa-0.20030801.41 libogg-devel-1.0-5.1 libgcc-ssa-3.5ssa-0.20030801.41 libstdc++-ssa-3.5ssa-0.20030801.41 libtermcap-devel-2.0.8-35 zlib-devel-1.1.4-8.1 libgcj-ssa-3.5ssa-0.20030801.41 libtool-1.4.3-6 glibc-kernheaders-2.4-8.34 glibc-devel-2.3.2-95.3 libjpeg-devel-6b-30 libmng-devel-1.0.4-3 libtiff-devel-3.5.7-13 libungif-devel-4.1.0-15 libole2-0.2.4-6 libart_lgpl-devel-2.3.11-2 glib2-devel-2.2.3-2.0 libxslt-devel-1.0.33-1 libbonobo-devel-2.2.3-1 gnome-libs-1.4.1.2.90-34.1 imlib-devel-1.9.13-12 libgnomecanvas-devel-2.2.0.2-2 libgnomeprintui22-devel-2.2.1.3-2.0 libbonoboui-devel-2.2.2-1 openoffice-libs-1.0.2-8 libmcrypt-2.5.7-1.dag.rhel3 libgcc-3.2.3-20 glibc-2.3.2-95.3 cracklib-2.7-22 glib2-2.2.3-2.0 libattr-2.2.0-1 zlib-1.1.4-8.1 libuser-0.51.7-1 libtool-libs-1.4.3-6 cups-libs-1.1.17-13.3.6 XFree86-libs-4.3.0-35.EL perl-libwww-perl-5.65-6 libxslt-1.0.33-1 VFlib2-2.25.6-17 libogg-1.0-5.1 libmng-1.0.4-3 libbonobo-2.2.3-1 pygtk2-libglade-1.99.16-8 libbonoboui-2.2.2-1 libgsf-1.6.0-4 libgtop2-2.0.2-1 libgnomeprint22-2.2.1.3-5 pwlib-1.4.7-6.EL libao-0.8.3-3 libusb-0.1.6-3 rh-postgresql-libs-7.3.4-8 libobjc-3.2.3-20 libf2c-3.2.3-20 libvorbis-devel-1.0-7 libstdc++-devel-3.2.3-20 libusb-devel-0.1.6-3 glibc-utils-2.3.2-95.3 libgcj-devel-3.2.3-20 glibc-headers-2.3.2-95.3 glib-devel-1.2.10-11.1 libpng10-1.0.13-8 gnome-libs-devel-1.4.1.2.90-34.1 libglade2-devel-2.0.1-3 librsvg2-devel-2.2.3-2 libmcrypt-devel-2.5.7-1.dag.rhel3 ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From misc at viceconsulting.co.nz Wed Sep 7 23:05:00 2005 From: misc at viceconsulting.co.nz (misc at viceconsulting.co.nz) Date: Thu, 8 Sep 2005 09:05:00 +1200 (NZST) Subject: Service check delays in distributed monitor setup Message-ID: <37392.127.0.0.1.1126127100.squirrel@www.goldenfields.co.nz> Hi Fred, I have encountered the exact same problem with my central Nagios server. It has about 1000 passive services, but only about 10 active services (the active services being used for the central Nagios server to self-monitor itself). The 1000 passive services receiving their results from the 5 distributed servers. When I restart the Central Nagios server, the active checks get scheduled for 3 hours+ into the future, but they never actually seem to run. For days the active checks have not actually been checking themselves. I tried changing the service_inter_check_delay_method to d for dumb, which appeared to schedule it when I expected (ie within about 5 mins after the restart) but it still didn't run them. Your idea of setting service_inter_check_delay_method=0.05 sounds good. I haven't had any luck getting the 10 or so active services checking on my central Nagios server. Is anyone able to confirm that this is a known problem in Nagios, is there a better workaround, is this to be fixed in 2.0 final? Fred, keep the list posted if you make further breakthroughs. Cheers Alex On 7 Sep 2005 at 11:03, Fred wrote: > I think I have found the source of my issue with distributed monitoring and > service checks. > > It turns out that if you enable distributed monitoring, even passive service > check definitions seem to get scheduled to run when nagios starts up. If > you have say 10350 services (give or take one) and use smart scheduling of > services, you could easily see 3+ hours between the time that the first service > is scheduled and the last one. Changing the smart schduling to "n" for > no delay causes the services to not be scheduled in the future, but by the > time nagios processes the entire configuration file, the start time is in > the past and I think nagios forgets about the service so it is never scheduled > again. > > I'm currently trying a service_inter_check_delay_method=0.05 which puts me > at about 3 minutes for 10,000+ services, which seems to be enough time for > nagios to startup and still have its first pending service scheduled in the > near future rather then the near past ... > > Does this make sense to anyone who has been messing with these configuration > settings? > > Is there a better way to do this? I.e., I would like for nagios to *not* > consider the passive checks in any scheduling. I actually only have a small > number of active checks which when run will populate the rest of the passive > checks for the entire cluster, the problem is that it seems the node that I > run these checks on is alphabetically *after* all of the other nodes so it > seems to be scheduled last and has services starting the furthest out. > > Thanks. > -FredC > > > > > > > ------------------------------------------------------- > SF.Net email is Sponsored by the Better Software Conference & EXPO > September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices > Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA > Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > > ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From potus98 at yahoo.com Wed Sep 7 23:29:52 2005 From: potus98 at yahoo.com (John Christian) Date: Wed, 7 Sep 2005 14:29:52 -0700 (PDT) Subject: perfparse, rrdtool, apan, other? Message-ID: <20050907212953.73610.qmail@web54703.mail.yahoo.com> What do you use for fancy reports? Since I'm new to the Nagios community, I'm interested in reading insights from you veterans before I charge down some path and find myself all alone. I need to send pretty monthly reports to the PHBs upstairs. They're mainly interested in uptime percentages they can forward to customers to demonstrate SLAs. I would also like the option of storing and graphing perforance data for things like disk utilization or memory usage. As usual, there are A LOT of options that might work. My experience in other areas has shown that although many open-source tools are available, only a few are widely used, alive, and maintained. I'm not interested in finding the perfect solution for my exact needs; instead, I'd like to find a good solution that meets many of my needs and is A) used by many other people, and B) relatively straightforward to setup. TIA for any input! -John ______________________________________________________ Click here to donate to the Hurricane Katrina relief effort. http://store.yahoo.com/redcross-donate3/ ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From psingh at inforelay.com Thu Sep 8 02:00:40 2005 From: psingh at inforelay.com (Paul Singh) Date: Wed, 7 Sep 2005 20:00:40 -0400 Subject: Notification Setup Question Message-ID: I'm trying to setup Nagios in such a way that I can be notified by email and pager during business hours. During the night, I'd like a notice to go to my email account first and then get escalated to my pager if the problem still exists 30 minutes later. Can someone get me started in the right direction? --Paul ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mnagel at willingminds.com Thu Sep 8 02:55:16 2005 From: mnagel at willingminds.com (Mark D. Nagel) Date: Wed, 07 Sep 2005 17:55:16 -0700 Subject: Notification Setup Question In-Reply-To: References: Message-ID: <431F8BF4.1090100@willingminds.com> Paul Singh wrote: > I'm trying to setup Nagios in such a way that I can be notified by > email and pager during business hours. During the night, I'd like a > notice to go to my email account first and then get escalated to my > pager if the problem still exists 30 minutes later. > > Can someone get me started in the right direction? > The trick is to define the multiple contact instances in different roles partitioned by time of day. Then you define one contactgroup listing all your roles and the right contact will be selected for the current time of day. Some untested potential examples follow, but your specific needs will likely be different. define contact { service_notification_period daytime_hours host_notification_period daytime_hours contact daytime_paul alias Paul - Daytime email paul at foo.com pager paulpager at foo.com service_notification_commands notify-by-email,notify-by-epager host_notification_commands host-notify-by-email,host-notify-by-epager } define contact { service_notification_period nighttime_hours host_notification_period nighttime_hours contact nighttime_paul alias Paul - Nighttime email paul at foo.com service_notification_commands notify-by-email host_notification_commands host-notify-by-email } define contact { service_notification_period nighttime_hours host_notification_period nighttime_hours contact nighttime_paul_pager alias Paul - Nighttime pager paulpager at foo.com service_notification_commands notify-by-epager host_notification_commands host-notify-by-epager } define contactgroup { contactgroup_name paul alias Paul 24x7 members daytime_paul,nighttime_paul } define hostgroupescalation { hostgroup_name some-host-group first_notification 2 last_notification 3 notification_interval 15 contact_groups paul,nighttime_paul_pager } This should do it -- email and page during the day, email only at night with an escalation to paging via a contact that only does paging at night. When daytime hits, the escalation will only contact 'paul', which includes paging as well during that time. Mark -- Mark D. Nagel, CCIE #3177 Principal Consultant, Willing Minds LLC (http://www.willingminds.com) tel: 714-630-4772, fax: 714-630-4773, fwd: 680979 ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From f1216 at yahoo.com Thu Sep 8 03:20:59 2005 From: f1216 at yahoo.com (Fred) Date: Wed, 7 Sep 2005 18:20:59 -0700 (PDT) Subject: Service check delays in distributed monitor setup In-Reply-To: <37392.127.0.0.1.1126127100.squirrel@www.goldenfields.co.nz> References: <37392.127.0.0.1.1126127100.squirrel@www.goldenfields.co.nz> Message-ID: <20050908012059.95096.qmail@web31906.mail.mud.yahoo.com> Unfortunately, setting the increment to a small number only worked to set the pending state to something that looked reasonable, however, the services still never get scheduled. My configuration *was* working at one point, I tweaked something and now no matter what I do, I can't get it to start monitoring again. My passive checks recieved from other monitor nodes all seem to get registered, its just the active checks that run on the master (head) node never see the light of day any more. If I regenerate the configuration to not use distributed monitoring, it works just fine, however, that puts way too much pressure on a single node. I removed the status.sav, but as I type this I'm thinking I should nuke all the cache files that nagios builds, maybe there is something that got munged in there ... We've used both Nagios 1.2 and now 2.0b3 (testing 2.0b4) and I have yet to need to crack open the source and make any mods ... looks like that time is coming ;-) -FredC --- misc at viceconsulting.co.nz wrote: > Hi Fred, > > I have encountered the exact same problem with my central Nagios server. > It has about 1000 passive services, but only about 10 active services (the > active services being used for the central Nagios server to self-monitor > itself). The 1000 passive services receiving their results from the 5 > distributed servers. > > When I restart the Central Nagios server, the active checks get scheduled > for 3 hours+ into the future, but they never actually seem to run. For > days the active checks have not actually been checking themselves. > > I tried changing the service_inter_check_delay_method to d for dumb, which > appeared to schedule it when I expected (ie within about 5 mins after the > restart) but it still didn't run them. > > Your idea of setting service_inter_check_delay_method=0.05 sounds good. I > haven't had any luck getting the 10 or so active services checking on my > central Nagios server. > > Is anyone able to confirm that this is a known problem in Nagios, is there > a better workaround, is this to be fixed in 2.0 final? > > Fred, keep the list posted if you make further breakthroughs. > > Cheers > Alex > > On 7 Sep 2005 at 11:03, Fred wrote: > > > I think I have found the source of my issue with distributed monitoring and > > service checks. > > > > It turns out that if you enable distributed monitoring, even passive > service > > check definitions seem to get scheduled to run when nagios starts up. If > > you have say 10350 services (give or take one) and use smart scheduling of > > services, you could easily see 3+ hours between the time that the first > service > > is scheduled and the last one. Changing the smart schduling to "n" for > > no delay causes the services to not be scheduled in the future, but by the > > time nagios processes the entire configuration file, the start time is in > > the past and I think nagios forgets about the service so it is never > scheduled > > again. > > > > I'm currently trying a service_inter_check_delay_method=0.05 which puts me > > at about 3 minutes for 10,000+ services, which seems to be enough time for > > nagios to startup and still have its first pending service scheduled in the > > near future rather then the near past ... > > > > Does this make sense to anyone who has been messing with these > configuration > > settings? > > > > Is there a better way to do this? I.e., I would like for nagios to *not* > > consider the passive checks in any scheduling. I actually only have a > small > > number of active checks which when run will populate the rest of the > passive > > checks for the entire cluster, the problem is that it seems the node that I > > run these checks on is alphabetically *after* all of the other nodes so it > > seems to be scheduled last and has services starting the furthest out. > > > > Thanks. > > -FredC > > > > > > > > > > > > > > ------------------------------------------------------- > > SF.Net email is Sponsored by the Better Software Conference & EXPO > > September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices > > Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA > > Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf > > _______________________________________________ > > Nagios-users mailing list > > Nagios-users at lists.sourceforge.net > > https://lists.sourceforge.net/lists/listinfo/nagios-users > > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > > ::: Messages without supporting info will risk being sent to /dev/null > > > > > > > > > > ------------------------------------------------------- > SF.Net email is Sponsored by the Better Software Conference & EXPO > September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices > Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA > Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting > any issue. > ::: Messages without supporting info will risk being sent to /dev/null > ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From xavier at rootshell.be Thu Sep 8 07:55:10 2005 From: xavier at rootshell.be (Xavier) Date: Thu, 8 Sep 2005 07:55:10 +0200 (CEST) Subject: Suggestions for alerts Message-ID: Hi, I would like to explain a "people"" problem I'm facing with... I'm using Nagios to monitor the whole IT platform inside my company. Works perfectly but... (there is always a "but..." ;-) Some engineers who receive alerts (sms or mail) would like to receive more "friendly" messages. Ex: instead of "swb.bru is down", they would like to receive "Master switch @ Colo down, BLES problem? Call +1 xxxxxx" Any suggestion? Xavier -- "Research is what I'm doing when I don't know what I'm doing." - Wernher Von Braun (1912-1977) ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From KleinBiesel at aol.com Thu Sep 8 08:32:42 2005 From: KleinBiesel at aol.com (KleinBiesel at aol.com) Date: Thu, 08 Sep 2005 02:32:42 -0400 Subject: Nagios works, but statusmap not!!!!!!!!!!!!!!!!! Message-ID: <7C27510F.592C674D.3B30519B@aol.com> hi all! i? m new to 'SourceForge', so i dont really know how this works.. my problem: we?ve got a new server. on this server, it was my job to install nagios. nagios worked greatly on the old server, so we packed the directory of the running nagios and tryd to unpack and config it on the new server. nagios -v gives no errors or warnings, but the function 'status map' gives an internal error and the error.log of my apache writes this: [Thu Sep 01 13:47:44 2005] [error] [client 192.168.0.121] /usr/local/nagios/sbin/statusmap.cgi: symbol lookup error: /opt/lampp/lib/libgd.so.2: undefined symbol: FT_New_Face, referer: http://alpha/xampp/share/side.html [Thu Sep 01 13:47:44 2005] [error] [client 192.168.0.121] Premature end of script headers: statusmap.cgi, referer: http://alpha/xampp/share/side.html i think my httpd.conf is right, because it worked an the old server. the only thing whats is, that i use an other apache-software is there any file i should for to check a special entry? is there any script i should that the configuration of nagios get the sources of my new apache-software? please dont tell that i should nagios reinstall. but thx for any help chronometer ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From al at its-lehmann.de Thu Sep 8 09:07:34 2005 From: al at its-lehmann.de (Arno Lehmann) Date: Thu, 08 Sep 2005 09:07:34 +0200 Subject: Nagios works, but statusmap not!!!!!!!!!!!!!!!!! In-Reply-To: <7C27510F.592C674D.3B30519B@aol.com> References: <7C27510F.592C674D.3B30519B@aol.com> Message-ID: <431FE336.8020809@its-lehmann.de> Hello, KleinBiesel at aol.com wrote: > hi all! > > i? m new to 'SourceForge', so i dont really know how this works.. > my problem: > we?ve got a new server. on this server, it was my job to install nagios. > nagios worked greatly on the old server, so we packed the directory of the running nagios and tryd to unpack and config it on the new server. nagios -v > gives no errors or warnings, but the function 'status map' gives an internal error and the error.log of my apache writes this: > > [Thu Sep 01 13:47:44 2005] [error] [client 192.168.0.121] /usr/local/nagios/sbin/statusmap.cgi: symbol lookup error: /opt/lampp/lib/libgd.so.2: undefined symbol: FT_New_Face, referer: http://alpha/xampp/share/side.html > [Thu Sep 01 13:47:44 2005] [error] [client 192.168.0.121] Premature end of script headers: statusmap.cgi, referer: http://alpha/xampp/share/side.html This looks like the necessary gl libraries are nt available. Make sure they are installed, verify the dinymic linker setup, and try again. If this doesn't help, installing the necessary development packages and recompiling on your new machine might be the best solution. Arno > i think my httpd.conf is right, because it worked an the old server. the only thing whats is, that i use an other apache-software > > is there any file i should for to check a special entry? > is there any script i should that the configuration of nagios get the sources of my new apache-software? > please dont tell that i should nagios reinstall. > > but thx for any help > chronometer > > > ------------------------------------------------------- > SF.Net email is Sponsored by the Better Software Conference & EXPO > September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices > Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA > Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -- IT-Service Lehmann al at its-lehmann.de Arno Lehmann http://www.its-lehmann.de ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Thu Sep 8 09:12:09 2005 From: ae at op5.se (Andreas Ericsson) Date: Thu, 08 Sep 2005 09:12:09 +0200 Subject: Suggestions for alerts In-Reply-To: References: Message-ID: <431FE449.6030401@op5.se> Xavier wrote: > Hi, > > I would like to explain a "people"" problem I'm facing with... > I'm using Nagios to monitor the whole IT platform inside my company. > Works perfectly but... (there is always a "but..." ;-) > > Some engineers who receive alerts (sms or mail) would like to > receive more "friendly" messages. > > Ex: instead of "swb.bru is down", they would like to receive > "Master switch @ Colo down, BLES problem? Call +1 xxxxxx" > > Any suggestion? > Attach the manual of the device in question to each and every failure-message. Seriously though. You can use the 'alias' field for hosts instead of, or together with, the host_name. The macro is $HOSTALIAS$. As for making suggestions to fixes, they should try the neo-cortex. It's a wonderfully adaptive tool based on neural-net technology and genetic programming. -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Lead Developer ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Ralph.Grothe at itdz-berlin.de Thu Sep 8 09:29:11 2005 From: Ralph.Grothe at itdz-berlin.de (Ralph.Grothe at itdz-berlin.de) Date: Thu, 8 Sep 2005 09:29:11 +0200 Subject: Suggestions for alerts Message-ID: <6B893C5F2902D311A23F0090272854FB0575B7FC@litex001.lit.verwalt-berlin.de> Hi Xavier, as an experienced Nagios user (unlike me) you probably know already that the contents and formatting of your alert notifications is entirely up to you, and that you can easily change predefined notification command definitions or add your own. With my configuration for example those are defined in NAGIOS_ROOT/etc/misccommands.cfg, and they are referenced in the various host and service definitions by the event_handler attribute. For more descriptive or "polite" message texts you can spice them up with the abundance of preset host and service macros or even your own user defined macros. Just have a dekko at your own config files to get an idea. Regards Ralph > -----Original Message----- > From: nagios-users-admin at lists.sourceforge.net > [mailto:nagios-users-admin at lists.sourceforge.net]On Behalf Of Xavier > Sent: Thursday, September 08, 2005 7:55 AM > To: Nagios Users > Subject: [Nagios-users] Suggestions for alerts > > > Hi, > > I would like to explain a "people"" problem I'm facing with... > I'm using Nagios to monitor the whole IT platform inside my company. > Works perfectly but... (there is always a "but..." ;-) > > Some engineers who receive alerts (sms or mail) would like to > receive more "friendly" messages. > > Ex: instead of "swb.bru is down", they would like to receive > "Master switch @ Colo down, BLES problem? Call +1 xxxxxx" > > Any suggestion? > > Xavier > -- > "Research is what I'm doing when I don't know what I'm doing." > - Wernher Von Braun (1912-1977) > > > ------------------------------------------------------- > SF.Net email is Sponsored by the Better Software Conference & EXPO > September 19-22, 2005 * San Francisco, CA * Development > Lifecycle Practices > Agile & Plan-Driven Development * Managing Projects & Teams * > Testing & QA > Security * Process Improvement & Measurement * > http://www.sqe.com/bsce5sf > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS > when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From morten.guldager at gmail.com Thu Sep 8 10:12:21 2005 From: morten.guldager at gmail.com (Morten Guldager) Date: Thu, 8 Sep 2005 10:12:21 +0200 Subject: Reading state of a contactgroup? Message-ID: Aloha! I have divided our objects in 3 categories: unix-oper, windows-oper and network-oper. Then I have defined 3 corresponding contactgroup's. Now I would like to implement 3 "traffic light's" in our main control center. Red : One or more UN-acknowledged Critical errors Yellow : One or more acknowledged Critical error Green : No errors But I cant figure out where read the state information in nagios. Maybe it does not exists at all. (on the contactgroup level) Suggestions? -- /Morten %-) ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From barbereau at gmail.com Thu Sep 8 10:32:35 2005 From: barbereau at gmail.com (=?ISO-8859-1?Q?S=E9bastien_Barbereau?=) Date: Thu, 8 Sep 2005 10:32:35 +0200 Subject: Reading state of a contactgroup? In-Reply-To: References: Message-ID: <4e0e33ee050908013254e776c0@mail.gmail.com> Hi, I think you should have a look at the "status_file" variable from your main configuration file ... On 9/8/05, Morten Guldager wrote: > > Aloha! > > I have divided our objects in 3 categories: unix-oper, windows-oper > and network-oper. > > Then I have defined 3 corresponding contactgroup's. > > Now I would like to implement 3 "traffic light's" in our main control > center. > > Red : One or more UN-acknowledged Critical errors > Yellow : One or more acknowledged Critical error > Green : No errors > > But I cant figure out where read the state information in nagios. > Maybe it does not exists at all. (on the contactgroup level) > > Suggestions? > > -- > /Morten %-) > > > ------------------------------------------------------- > SF.Net email is Sponsored by the Better Software Conference & EXPO > September 19-22, 2005 * San Francisco, CA * Development Lifecycle > Practices > Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA > Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -------------- next part -------------- An HTML attachment was scrubbed... URL: From morten.guldager at gmail.com Thu Sep 8 10:50:37 2005 From: morten.guldager at gmail.com (Morten Guldager) Date: Thu, 8 Sep 2005 10:50:37 +0200 Subject: Reading state of a contactgroup? In-Reply-To: <4e0e33ee050908013254e776c0@mail.gmail.com> References: <4e0e33ee050908013254e776c0@mail.gmail.com> Message-ID: On 9/8/05, S?bastien Barbereau wrote: > > I think you should have a look at the "status_file" variable from your main > configuration file ... As far as I can tell the status_file only contains information on hosts and services. I need something "collected". My 3 contactgroup's contains multiple objects each, I need only one traffic light for each contactgroup, telling something about the "worst" condition in that particular group. /Morten %-) ---------------------------------------------------------------------------- > On 9/8/05, Morten Guldager wrote: > > Aloha! > > > > I have divided our objects in 3 categories: unix-oper, windows-oper > > and network-oper. > > > > Then I have defined 3 corresponding contactgroup's. > > > > Now I would like to implement 3 "traffic light's" in our main control > center. > > > > Red : One or more UN-acknowledged Critical errors > > Yellow : One or more acknowledged Critical error > > Green : No errors > > > > But I cant figure out where read the state information in nagios. > > Maybe it does not exists at all. (on the contactgroup level) > > > > Suggestions? > > > > -- > > /Morten %-) > > > > > > ------------------------------------------------------- > > SF.Net email is Sponsored by the Better Software Conference & EXPO > > September 19-22, 2005 * San Francisco, CA * Development Lifecycle > Practices > > Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA > > Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf > > _______________________________________________ > > Nagios-users mailing list > > Nagios-users at lists.sourceforge.net > > https://lists.sourceforge.net/lists/listinfo/nagios-users > > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > > ::: Messages without supporting info will risk being sent to /dev/null > > > > -- /Morten %-) ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Thu Sep 8 11:04:12 2005 From: ae at op5.se (Andreas Ericsson) Date: Thu, 08 Sep 2005 11:04:12 +0200 Subject: Reading state of a contactgroup? In-Reply-To: References: <4e0e33ee050908013254e776c0@mail.gmail.com> Message-ID: <431FFE8C.4040402@op5.se> Morten Guldager wrote: > On 9/8/05, S?bastien Barbereau wrote: > >>I think you should have a look at the "status_file" variable from your main >>configuration file ... > > > As far as I can tell the status_file only contains information on > hosts and services. I need something "collected". > My 3 contactgroup's contains multiple objects each, I need only one > traffic light for each contactgroup, telling something about the > "worst" condition in that particular group. > Assuming the contacts in those different contactgroups are only contacts for the hosts they manage, the current Nagios gui will do this for you (but in greater detail) on the tactical overview, so why fiddle with extra scripts? > > /Morten %-) > ---------------------------------------------------------------------------- > >>On 9/8/05, Morten Guldager wrote: >> >>>Aloha! >>> >>>I have divided our objects in 3 categories: unix-oper, windows-oper >>>and network-oper. >>> >>>Then I have defined 3 corresponding contactgroup's. >>> >>>Now I would like to implement 3 "traffic light's" in our main control >> >>center. >> >>>Red : One or more UN-acknowledged Critical errors >>>Yellow : One or more acknowledged Critical error >>>Green : No errors >>> >>>But I cant figure out where read the state information in nagios. >>>Maybe it does not exists at all. (on the contactgroup level) >>> >>>Suggestions? >>> >>>-- >>>/Morten %-) >>> >>> >>>------------------------------------------------------- >>>SF.Net email is Sponsored by the Better Software Conference & EXPO >>>September 19-22, 2005 * San Francisco, CA * Development Lifecycle >> >>Practices >> >>>Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA >>>Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf >>>_______________________________________________ >>>Nagios-users mailing list >>>Nagios-users at lists.sourceforge.net >>>https://lists.sourceforge.net/lists/listinfo/nagios-users >>>::: Please include Nagios version, plugin version (-v) and OS when >> >>reporting any issue. >> >>>::: Messages without supporting info will risk being sent to /dev/null >>> >> >> > > > -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Lead Developer ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From chandresh.suthar at gmail.com Thu Sep 8 11:11:32 2005 From: chandresh.suthar at gmail.com (Chandresh Suthar) Date: Thu, 8 Sep 2005 14:41:32 +0530 Subject: please help me In-Reply-To: <9e938faf0509070949202407f8@mail.gmail.com> References: <9e938faf0509070949202407f8@mail.gmail.com> Message-ID: yes It's working. So why i m not getting any alerts ? I didn't find any failure messages in maillogs. Chandresh On 9/7/05, Zac wrote: > > Just to confirm, if you copy and paste : > /usr/bin/printf "%b" "***** Nagios *****\n\nNotification Type: > $NOTIFICATIONTYPE$\nHost: $HOSTNAME$\nState: $HOSTSTATE$\nAddress: > $HOSTADDRESS$\nInfo: $OUTPUT$\n\nDate/Time: $LONGDATETIME$\n" | /bin/mail -s > "Host $HOSTSTATE$ alert for $HOSTNAME$!" $CONTACTEMAIL$ > > and replace the $*$ with test, except for $CONTACTEMAIL$ which should be > the full e-mail address of where the alert is going. Everything works? As > far as I can tell your config looks good. If this works at the command line, > you should check your mail logs. Hope this helps. > > -Zac > > On 9/7/05, Chandresh Suthar wrote: > > > > Hi, > > Please help me in sending notifications. I have only one configuration > > file for contactfgroup,hostsgroup etc. "minimal.cfg" > > minimal.cfg : > > define timeperiod{ > > timeperiod_name 24x7 > > alias 24 Hours A Day, 7 Days A Week > > sunday 00:00-24:00 > > monday 00:00-24:00 > > tuesday 00:00-24:00 > > wednesday 00:00-24:00 > > thursday 00:00-24:00 > > friday 00:00-24:00 > > saturday 00:00-24:00 > > } > > > > define command{ > > command_name notify-by-email > > command_line /usr/bin/printf "%b" "***** Nagios *****\n\nNotification > > Type: $NOTIFICATIONTYPE$\n\nService: $SERVICEDESC$\nHost: > > $HOSTALIAS$\nAddress: $HOSTADDRESS$\nState: $SERVICESTATE$\n\nDate/Time: > > $LONGDATETIME$\n\nAdditional Info:\n\n$OUTPUT$" | /bin/mail -s "** > > $NOTIFICATIONTYPE$ alert - $HOSTALIAS$/$SERVICEDESC$ is $SERVICESTATE$ **" > > $CONTACTEMAIL$ > > } > > > > > > # This is a sample host notification command that can be used to send > > email > > # notifications (about host alerts) to contacts. > > > > define command{ > > command_name host-notify-by-email > > command_line /usr/bin/printf "%b" "***** Nagios *****\n\nNotification > > Type: $NOTIFICATIONTYPE$\nHost: $HOSTNAME$\nState: $HOSTSTATE$\nAddress: > > $HOSTADDRESS$\nInfo: $OUTPUT$\n\nDate/Time: $LONGDATETIME$\n" | /bin/mail -s > > "Host $HOSTSTATE$ alert for $HOSTNAME$!" $CONTACTEMAIL$ > > } > > > > define contact{ > > contact_name nagios > > alias Nagios Admin > > service_notification_period 24x7 > > host_notification_period 24x7 > > service_notification_options w,u,c,r > > host_notification_options d,r > > service_notification_commands notify-by-email > > host_notification_commands host-notify-by-email > > email nagios at localhost > > } > > > > define contact{ > > contact_name chandresh > > alias Administrator > > service_notification_period 24x7 > > host_notification_period 24x7 > > service_notification_options w,u,c,r > > host_notification_options d,r > > service_notification_commands notify-by-email > > host_notification_commands host-notify-by-email > > email "personalemail id" > > } > > > > define contactgroup{ > > contactgroup_name admins > > alias Nagios Administrators > > members nagios > > } > > > > define contactgroup{ > > contactgroup_name admingrp > > alias Administrators > > members chandresh > > } > > > > define host{ > > name generic-host ; The name of this host template > > notifications_enabled 1 ; Host notifications are enabled > > event_handler_enabled 1 ; Host event handler is enabled > > flap_detection_enabled 1 ; Flap detection is enabled > > failure_prediction_enabled 1 ; Failure prediction is enabled > > process_perf_data 1 ; Process performance data > > retain_status_information 1 ; Retain status information across program > > restarts > > retain_nonstatus_information 1 ; Retain non-status information across > > program restarts > > register 0 ; DONT REGISTER THIS DEFINITION - ITS NOT A REAL HOST, JUST A > > TEMPLATE! > > } > > > > > > define host{ > > use generic-host ; Name of host template to use > > host_name webmail.com > > alias localhost > > address 127.0.0.1 > > check_command check-host-alive > > max_check_attempts 10 > > notification_interval 120 > > notification_period 24x7 > > notification_options d,r > > contact_groups admingrp > > } > > define service{ > > use generic-service ; Name of service template to use > > host_name webmail.com > > service_description HTTP > > is_volatile 0 > > check_period 24x7 > > max_check_attempts 4 > > normal_check_interval 5 > > retry_check_interval 1 > > contact_groups admingrp > > notification_interval 960 > > notification_period 24x7 > > check_command check_http > > } > > > > I am able to send mails to my persoanl email id by "mail" command. But > > notification is not going. I have configured sendmail. I am not even > > receiving mails on local system at nagios at localhost( I tried). I am even > > not getting any error messages in logs of nagios. > > > > Please help. > > > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From jjk_saji at yahoo.com Thu Sep 8 11:37:35 2005 From: jjk_saji at yahoo.com (John Joseph) Date: Thu, 8 Sep 2005 10:37:35 +0100 (BST) Subject: please help me In-Reply-To: References: Message-ID: <20050908093735.48917.qmail@web40810.mail.yahoo.com> Hi I also tried the command /usr/bin/printf "%b" "***** Nagios *****\n\nNotification Type: $NOTIFICATIONTYPE$\nHost: $HOSTNAME$\nState: $HOSTSTATE$\nAddress: $HOSTADDRESS$\nInfo: $OUTPUT$\n\nDate/Time: $LONGDATETIME$\n" | /bin/mail -s "Host $HOSTSTATE$ alert for $HOSTNAME$!" joseph at nagtest.com in the command mode , the message did not go, it gave error as -bash: !": event not found Then I removed the ! after $HOSTNAME$ and tried /usr/bin/printf "%b" "***** Nagios *****\n\nNotification Type: $NOTIFICATIONTYPE$\nHost: $HOSTNAME$\nState: $HOSTSTATE$\nAddress: $HOSTADDRESS$\nInfo: $OUTPUT$\n\nDate/Time: $LONGDATETIME$\n" | /bin/mail -s "Host $HOSTSTATE$ alert for $HOSTNAME$" joseph at nagtest.com The mail went , Shalle I remove the ! parameter from the "$HOSTNAME$!" and try it on nagios I need advice Thanks Joseph John --- Chandresh Suthar wrote: > yes It's working. So why i m not getting any alerts > ? I didn't find any > failure messages in maillogs. > > Chandresh > > On 9/7/05, Zac wrote: > > > > Just to confirm, if you copy and paste : > > /usr/bin/printf "%b" "***** Nagios > *****\n\nNotification Type: > > $NOTIFICATIONTYPE$\nHost: $HOSTNAME$\nState: > $HOSTSTATE$\nAddress: > > $HOSTADDRESS$\nInfo: $OUTPUT$\n\nDate/Time: > $LONGDATETIME$\n" | /bin/mail -s > > "Host $HOSTSTATE$ alert for $HOSTNAME$!" > $CONTACTEMAIL$ > > > > and replace the $*$ with test, except for > $CONTACTEMAIL$ which should be > > the full e-mail address of where the alert is > going. Everything works? As > > far as I can tell your config looks good. If this > works at the command line, > > you should check your mail logs. Hope this helps. > > > > -Zac > > > > On 9/7/05, Chandresh Suthar > wrote: > > > > > > Hi, > > > Please help me in sending notifications. I have > only one configuration > > > file for contactfgroup,hostsgroup etc. > "minimal.cfg" > > > minimal.cfg : > > > define timeperiod{ > > > timeperiod_name 24x7 > > > alias 24 Hours A Day, 7 Days A Week > > > sunday 00:00-24:00 > > > monday 00:00-24:00 > > > tuesday 00:00-24:00 > > > wednesday 00:00-24:00 > > > thursday 00:00-24:00 > > > friday 00:00-24:00 > > > saturday 00:00-24:00 > > > } > > > > > > define command{ > > > command_name notify-by-email > > > command_line /usr/bin/printf "%b" "***** Nagios > *****\n\nNotification > > > Type: $NOTIFICATIONTYPE$\n\nService: > $SERVICEDESC$\nHost: > > > $HOSTALIAS$\nAddress: $HOSTADDRESS$\nState: > $SERVICESTATE$\n\nDate/Time: > > > $LONGDATETIME$\n\nAdditional Info:\n\n$OUTPUT$" > | /bin/mail -s "** > > > $NOTIFICATIONTYPE$ alert - > $HOSTALIAS$/$SERVICEDESC$ is $SERVICESTATE$ **" > > > $CONTACTEMAIL$ > > > } > > > > > > > > > # This is a sample host notification command > that can be used to send > > > email > > > # notifications (about host alerts) to contacts. > > > > > > define command{ > > > command_name host-notify-by-email > > > command_line /usr/bin/printf "%b" "***** Nagios > *****\n\nNotification > > > Type: $NOTIFICATIONTYPE$\nHost: > $HOSTNAME$\nState: $HOSTSTATE$\nAddress: > > > $HOSTADDRESS$\nInfo: $OUTPUT$\n\nDate/Time: > $LONGDATETIME$\n" | /bin/mail -s > > > "Host $HOSTSTATE$ alert for $HOSTNAME$!" > $CONTACTEMAIL$ > > > } > > > > > > define contact{ > > > contact_name nagios > > > alias Nagios Admin > > > service_notification_period 24x7 > > > host_notification_period 24x7 > > > service_notification_options w,u,c,r > > > host_notification_options d,r > > > service_notification_commands notify-by-email > > > host_notification_commands host-notify-by-email > > > email nagios at localhost > > > } > > > > > > define contact{ > > > contact_name chandresh > > > alias Administrator > > > service_notification_period 24x7 > > > host_notification_period 24x7 > > > service_notification_options w,u,c,r > > > host_notification_options d,r > > > service_notification_commands notify-by-email > > > host_notification_commands host-notify-by-email > > > email "personalemail id" > > > } > > > > > > define contactgroup{ > > > contactgroup_name admins > > > alias Nagios Administrators > > > members nagios > > > } > > > > > > define contactgroup{ > > > contactgroup_name admingrp > > > alias Administrators > > > members chandresh > > > } > > > > > > define host{ > > > name generic-host ; The name of this host > template > > > notifications_enabled 1 ; Host notifications are > enabled > > > event_handler_enabled 1 ; Host event handler is > enabled > > > flap_detection_enabled 1 ; Flap detection is > enabled > > > failure_prediction_enabled 1 ; Failure > prediction is enabled > > > process_perf_data 1 ; Process performance data > > > retain_status_information 1 ; Retain status > information across program > > > restarts > > > retain_nonstatus_information 1 ; Retain > non-status information across > > > program restarts > > > register 0 ; DONT REGISTER THIS DEFINITION - ITS > NOT A REAL HOST, JUST A > > > TEMPLATE! > > > } > > > > > > > > > define host{ > > > use generic-host ; Name of host template to use > > > host_name webmail.com > > > alias localhost > > > address 127.0.0.1 > > > check_command check-host-alive > > > max_check_attempts 10 > > > notification_interval 120 > > > notification_period 24x7 > > > notification_options d,r > > > contact_groups admingrp > > > } > > > define service{ > > > use generic-service ; Name of service template > to use > > > host_name webmail.com > > > service_description HTTP > > > is_volatile 0 > > > check_period 24x7 > > > max_check_attempts 4 > > > normal_check_interval 5 > > > retry_check_interval 1 > > > contact_groups admingrp > > > notification_interval 960 > > > notification_period 24x7 > > > check_command check_http > > > } > > > > > > I am able to send mails to my persoanl email id > by "mail" command. But > > > notification is not going. I have configured > sendmail. I am not even > > > receiving mails on local system at > nagios at localhost( I tried). I am even > > > not getting any error messages in logs of > nagios. > > > > > > Please help. > > > > > > > > ___________________________________________________________ How much free photo storage do you get? Store your holiday snaps for FREE with Yahoo! Photos http://uk.photos.yahoo.com ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From morten.guldager at gmail.com Thu Sep 8 11:52:51 2005 From: morten.guldager at gmail.com (Morten Guldager) Date: Thu, 8 Sep 2005 11:52:51 +0200 Subject: Reading state of a contactgroup? In-Reply-To: <431FFE8C.4040402@op5.se> References: <4e0e33ee050908013254e776c0@mail.gmail.com> <431FFE8C.4040402@op5.se> Message-ID: On 9/8/05, Andreas Ericsson wrote: > Morten Guldager wrote: > > On 9/8/05, S?bastien Barbereau wrote: > > > >>I think you should have a look at the "status_file" variable from your main > >>configuration file ... > > > > > > As far as I can tell the status_file only contains information on > > hosts and services. I need something "collected". > > My 3 contactgroup's contains multiple objects each, I need only one > > traffic light for each contactgroup, telling something about the > > "worst" condition in that particular group. > > > > Assuming the contacts in those different contactgroups are only contacts > for the hosts they manage, the current Nagios gui will do this for you > (but in greater detail) on the tactical overview, so why fiddle with > extra scripts? My "traffic lights" are not webbased. We are talking real analog hardware here. I know how to manipulate that part, I just cant figure out where to get the information from nagios. My current "best idea", (which sucks!), is to make a contact with a notification which logs events to a file. Then a daemon listens to this file and inserts changes to a database. "OK" states removes from the database. Then its a simple select to figure out the collective status for a contactgroup. But I think I went to the moon and back again for allmost nothing! /Morten %-) > > ---------------------------------------------------------------------------- > > > >>On 9/8/05, Morten Guldager wrote: > >> > >>>Aloha! > >>> > >>>I have divided our objects in 3 categories: unix-oper, windows-oper > >>>and network-oper. > >>> > >>>Then I have defined 3 corresponding contactgroup's. > >>> > >>>Now I would like to implement 3 "traffic light's" in our main control > >> > >>center. > >> > >>>Red : One or more UN-acknowledged Critical errors > >>>Yellow : One or more acknowledged Critical error > >>>Green : No errors > >>> > >>>But I cant figure out where read the state information in nagios. > >>>Maybe it does not exists at all. (on the contactgroup level) > >>> > >>>Suggestions? > >>> > >>>-- > >>>/Morten %-) > >>> > >>> > >>>------------------------------------------------------- > >>>SF.Net email is Sponsored by the Better Software Conference & EXPO > >>>September 19-22, 2005 * San Francisco, CA * Development Lifecycle > >> > >>Practices > >> > >>>Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA > >>>Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf > >>>_______________________________________________ > >>>Nagios-users mailing list > >>>Nagios-users at lists.sourceforge.net > >>>https://lists.sourceforge.net/lists/listinfo/nagios-users > >>>::: Please include Nagios version, plugin version (-v) and OS when > >> > >>reporting any issue. > >> > >>>::: Messages without supporting info will risk being sent to /dev/null > >>> > >> > >> > > > > > > > > -- > Andreas Ericsson andreas.ericsson at op5.se > OP5 AB www.op5.se > Lead Developer > > > ------------------------------------------------------- > SF.Net email is Sponsored by the Better Software Conference & EXPO > September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices > Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA > Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -- /Morten %-) ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Thu Sep 8 14:01:25 2005 From: ae at op5.se (Andreas Ericsson) Date: Thu, 08 Sep 2005 14:01:25 +0200 Subject: Reading state of a contactgroup? In-Reply-To: References: <4e0e33ee050908013254e776c0@mail.gmail.com> <431FFE8C.4040402@op5.se> Message-ID: <43202815.4070900@op5.se> Morten Guldager wrote: > On 9/8/05, Andreas Ericsson wrote: > >>Morten Guldager wrote: >> >>>On 9/8/05, S?bastien Barbereau wrote: >>> >>> >>>>I think you should have a look at the "status_file" variable from your main >>>>configuration file ... >>> >>> >>>As far as I can tell the status_file only contains information on >>>hosts and services. I need something "collected". >>>My 3 contactgroup's contains multiple objects each, I need only one >>>traffic light for each contactgroup, telling something about the >>>"worst" condition in that particular group. >>> >> >>Assuming the contacts in those different contactgroups are only contacts >>for the hosts they manage, the current Nagios gui will do this for you >>(but in greater detail) on the tactical overview, so why fiddle with >>extra scripts? > > > My "traffic lights" are not webbased. We are talking real analog hardware here. > I know how to manipulate that part, I just cant figure out where to > get the information from nagios. > > My current "best idea", (which sucks!), is to make a contact with a > notification which logs events to a file. Then a daemon listens to > this file and inserts changes to a database. "OK" states removes from > the database. > Then its a simple select to figure out the collective status for a contactgroup. > This is not a bad idea at all, but why use a daemon to listen to a file when it can just as well listen to a socket or a pipe? Have the daemon either run some program to set the lights or do it itself. Borrowing the code from Nagios to parse the objects.cache file should be a no-brainer. Or better yet, write a NEB-module to handle it. That way you get access to the Nagios configuration when it's already parsed and all should be well. -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Lead Developer ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From sudheer at tgs-solutions.com Thu Sep 8 14:32:12 2005 From: sudheer at tgs-solutions.com (Sudheer Muddappa) Date: Thu, 08 Sep 2005 08:32:12 -0400 Subject: check_nt_cpuload help Message-ID: <43202F4C.10805@tgs-solutions.com> Hi guys, In my services.cfg file I have for check_nt_cpuload as define service{ host_name stealth service_description CPU LOAD is_volatile 0 check_period 24x7 max_check_attempts 3 normal_check_interval 5 retry_check_interval 1 contact_groups server-admins notification_interval 240 notification_period 24x7 notification_options u,c,r check_command check_nt_cpuload!60!80%!90%!90!80%!90! } Is this is correct? Just want to check the cpu load every 60 minutes. On the web page I see a message saying not enough values for -l parameters check_disk is workgin properly for the same server. Thanks, -- Sudheer Muddappa ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Leonard.Miller at baesystems.com Thu Sep 8 14:44:17 2005 From: Leonard.Miller at baesystems.com (Leonard Miller) Date: Thu, 08 Sep 2005 07:44:17 -0500 Subject: Status Map Woes Message-ID: Hi, I've recently seen problems with status map, now it's my turn. I can see the map, but all my hosts are in a circle with the ? image representing each host. I created a hostextinfo.cfg file and added cfg_file=/usr/local/nagios/etc/hostextinfo.cfg to my nagios.cfg file. I tried changing the coords for my host several times, but it just won't budge. So far this is all I have in my hostextinfo.cfg file: define hostextinfo{ host_name HOSTNAME notes Wireless WDS 2d_coords 300,550 3d_coords 100.0,50.0,75.0 } Running Nagios 2.0b4 What I am trying to do is move because I want set my wireless radios in a tree under the WDS. Thanks in advance Leonard ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From dshurett at alphaomegahosting.com Thu Sep 8 15:11:55 2005 From: dshurett at alphaomegahosting.com (Danny Shurett) Date: Thu, 08 Sep 2005 09:11:55 -0400 Subject: Problem with check_load on centos4 Message-ID: ./check_load 5 10 15 10 15 20 CRITICAL - load average: 0.55, 0.30, 0.11|load1=0.550;0.000;0.000;0; load5=0.300;0.000;0.000;0; load15=0.110;0.000;0.000;0; As you can see from the check command above, the server load is way below the levels indicated in the check command but are still coming up as critical. Any suggestions? ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From dhopp at GOCSC.com Thu Sep 8 15:15:47 2005 From: dhopp at GOCSC.com (Dennis Hopp) Date: Thu, 8 Sep 2005 08:15:47 -0500 Subject: Unknown status, alerts over and over Message-ID: <395D09D0C949AF40B9DDFEE9C93A47FF04799CB1@hudson.gocsc.com> I've had a few servers where the snmp service on them will crash and so the nagios plugins cannot get the status of different services. Nagios will send an alert that the service is "UNKNOWN" which is what I want, but then it will send the alert over and over and over and over and over until the snmp service is fixed. For example I got 75+ messages overnight about one servers disk was unknown (the server was fine, just snmp wasn't responding). How can I get nagios to send the alert for "UNKNOWN" once and then send a recovery message when the problem is fixed? An example service definition is below. The no-warning-service just doesn't send a warning alert (but the web interface will show warning status). define service{ use no-warning-service hosts lewis service_description Disk-D is_volatile 0 check_period 24x7 max_check_attempts 5 normal_check_interval 30 retry_check_interval 5 contact_groups domain-admins check_command check_windows_disk!D!85!94 } Thanks, --Dennis * "Privileged/Confidential Information of Communications Supply Corp. may be contained in this message. If you are not the addressee of this message, you may not copy, use or deliver this message to anyone. In such event, you should destroy the message and kindly notify the sender by reply e-mail. It is understood that opinions or conclusions that do not relate to the official business of Communications Supply Corp. are neither given nor endorsed by Communications Supply Corp." ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From chris at aidworld.org Thu Sep 8 16:54:06 2005 From: chris at aidworld.org (Chris Wilson) Date: Thu, 08 Sep 2005 15:54:06 +0100 Subject: Problem with check_load on centos4 In-Reply-To: References: Message-ID: <1126191246.28001.287.camel@localhost> Hi Danny, > ./check_load 5 10 15 10 15 20 > CRITICAL - load average: 0.55, 0.30, 0.11|load1=0.550;0.000;0.000;0; > load5=0.300;0.000;0.000;0; load15=0.110;0.000;0.000;0; > As you can see from the check command above, the server load is way below > the levels indicated in the check command but are still coming up as > critical. Any suggestions? Are you sure that's the right way to use check_load? When I try that, I get: > [chris at dev anthill]$ /usr/local/nagios/libexec/check_load 5 10 15 10 > 15 20 > Parameter inconsistency: 5-minute "warning load" greater than > "critical load". > Usage: check_load -w WLOAD1,WLOAD5,WLOAD15 -c CLOAD1,CLOAD5,CLOAD15 > check_load --version > check_load --help > But this works: > [chris at dev anthill]$ /usr/local/nagios/libexec/check_load -w 5,10,15 > -c 10,15,20 > OK - load average: 1.20, 0.92, 0.63 Cheers, Chris. -- (aidworld) chris wilson | chief engineer (chris at aidworld.org) ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Thu Sep 8 17:25:49 2005 From: marc at ena.com (Marc Powell) Date: Thu, 8 Sep 2005 10:25:49 -0500 Subject: Unknown status, alerts over and over Message-ID: > -----Original Message----- > From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users- > admin at lists.sourceforge.net] On Behalf Of Dennis Hopp > Sent: Thursday, September 08, 2005 8:16 AM > To: Nagios-users at lists.sourceforge.net > Subject: [Nagios-users] Unknown status, alerts over and over > > I've had a few servers where the snmp service on them will crash and so > the nagios plugins cannot get the status of different services. Nagios > will send an alert that the service is "UNKNOWN" which is what I want, > but then it will send the alert over and over and over and over and over > until the snmp service is fixed. For example I got 75+ messages > overnight about one servers disk was unknown (the server was fine, just > snmp wasn't responding). > > How can I get nagios to send the alert for "UNKNOWN" once and then send > a recovery message when the problem is fixed? http://nagios.sourceforge.net/docs/1_0/xodtemplate.html#service notification_interval: This directive is used to define the number of "time units" to wait before re-notifying a contact that this service is still in a non-OK state. Unless you've changed the interval_length directive from the default value of 60, this number will mean minutes. If you set this value to 0, Nagios will not re-notify contacts about problems for this service - only one problem notification will be sent out, unless there has been a state change. > * > "Privileged/Confidential Information of Communications Supply > Corp. may be contained in this message. If you are not the addressee of How do I know if this is privileged or confidential? > this message, you may not copy, use or deliver this message to anyone. I wasn't the addressee of the message. I guess I shouldn't respond. > In such event, you should destroy the message and kindly notify the Burning the message now. There are of course other permanently and publicly archived copies of this e-mail that will remain available as long as the Internet exists but I guess that's ok. > sender by reply e-mail. It is understood that opinions or conclusions > that do not relate to the official business of Communications Supply > Corp. are neither given nor endorsed by Communications Supply Corp." Is this official business? How am I supposed to know? -- Marc ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From blakekrone at gmail.com Thu Sep 8 17:54:44 2005 From: blakekrone at gmail.com (Blake Krone) Date: Thu, 8 Sep 2005 09:54:44 -0600 Subject: Setting env variables from misc commands (email from field) In-Reply-To: References: Message-ID: I should have stated that I'm using nullmailer, don't need a full featured SMTP, I just relay off of our exchange server. On 9/2/05, Subhendu Ghosh wrote: > > On Fri, 2 Sep 2005, Blake Krone wrote: > > > Hello all, I'm trying to set it so that when using nullmailer the From > field > > will be "alerts" instead of root as it is currently. I tried to set it > by > > doing: > > export USER=alerts;/usr/bin/printf "%s" ...... etc > > but nagios won't send out alerts when I have it set like that. > > > > How can I set the from field?? > > > > Thanks! > > Blake > > > > > sendmail genericstable > > http://www.linux.com/howtos/Sendmail-Address-Rewrite.shtml > > -- > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From chris at aidworld.org Thu Sep 8 18:06:05 2005 From: chris at aidworld.org (Chris Wilson) Date: Thu, 08 Sep 2005 17:06:05 +0100 Subject: Setting env variables from misc commands (email from field) In-Reply-To: References: Message-ID: <1126195564.31899.38.camel@localhost> Hi Blake, > I should have stated that I'm using nullmailer, don't need a full > featured SMTP, I just relay off of our exchange server This is really a nullmailer question, not Nagios. Changing the address should be done in nullmailer, and maybe it ignores the USER environment variable. Check the nullmailer docs for how to do this. If it doesn't support it, you could try another mailer (mailx maybe). Cheers, Chris. -- (aidworld) chris wilson | chief engineer (chris at aidworld.org) ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ecleofe7 at yahoo.com Thu Sep 8 18:17:28 2005 From: ecleofe7 at yahoo.com (Eduard B. Cleofe) Date: Thu, 8 Sep 2005 09:17:28 -0700 (PDT) Subject: Can nagios affect network performance? Message-ID: <20050908161728.67470.qmail@web50709.mail.yahoo.com> Hi Guys, Id like to know if nagios can degrade the network performance,especially when it comes to bandwith and process of servers. Because im planning to setup a nagios network monitoring in a separate pc for our servers and terminals of our client.we are an isp btw. But our nms has monitoring already but it is not good as nagios i think on which you can do a lot of things on it. The services to be running will check only if the servers is alive,temperature,disk usage and some other things i could monitor to maintain our network.Plus the 1000 terminals on which to be monitor by ping and http only. Hope for your answer guys. Thank you. eduard eng'r.eduard __________________________________________________ Do You Yahoo!? Tired of spam? Yahoo! Mail has the best spam protection around http://mail.yahoo.com -------------- next part -------------- An HTML attachment was scrubbed... URL: From rossz at vamos-wentworth.org Thu Sep 8 18:25:30 2005 From: rossz at vamos-wentworth.org (Rossz Vamos-Wentworth) Date: Thu, 08 Sep 2005 09:25:30 -0700 Subject: checking process time Message-ID: <432065FA.90700@vamos-wentworth.org> I have a perl script used as a pipe for email that does some special processing of data. Occassionally, unfortunately, it gets "stuck" and does not terminate. When this happens, it ends up using most of the CPU and pretty much screws up the system. Until I can track down what is causing the infinite loop I was wondering if there was a way to check the life of a process of a specific name and execute an event handler if it's been running too long. The script should only take a few seconds to run, so I figure if it is more than a few minutes old I can simply have nagios kill the problem process (e.g. (kill -9 pid" should do the job). There is the problem if necessary privileges, too, but that's easy enough to overcome. -- Rossz ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From tjn at umn.edu Thu Sep 8 18:28:30 2005 From: tjn at umn.edu (Travis J. Noll) Date: Thu, 08 Sep 2005 11:28:30 -0500 Subject: partial service_perfdata_command failure In-Reply-To: <431F3177.9070203@umn.edu> References: <431F3177.9070203@umn.edu> Message-ID: <432066AE.6010406@umn.edu> Nagios Know-it-alls, Is there some very basic configuration I'm overlooking to enable service_perfdata_command on a host by host or service by service basis? When I look in the Nagios gui, the View Config->Services->Process Performance Data column is Yes across the board, yet when I enable the stock service_perfdata_command=process-service-perfdata line I get only a subset of the services that are actually being monitored. I should mention at least this basic configuration information, and I can happily provide more as needed. Nagios 1.2 Linux myhost 2.4.21-32.0.1.ELsmp #1 SMP Tue May 17 17:52:23 EDT 2005 i686 i686 i386 GNU/Linu Much thanks, -Travis Travis J. Noll wrote: > Thanks in advance for any thoughts on this matter. > > I've been handed the keys to a generally neglected nagios install and > been given marching orders to get load / mem usage graphs up and > running. I found a partially configured nagiostat in our code tree and > have gotten it to work on some occasions. > > My problem is that the service_perfdata_command does not seem to be > running for all services. I have the lines: > > # nagiostat > service_perfdata_command=service-perf-data-handler > > but have also reverted to uncommenting: > > # testing > #service_perfdata_command=process-service-perfdata > > where I was able to see both in > > /usr/local/nagios/nagiostat/debug.log > > and > > /tmp/service-perfdata > > that not all of the service updates available via the web interface are > being "post-processed" > > One host is handing off disk usage information to the > service_perfdata_command but not load or memory services. Other hosts > are able to pipe load and memory usage information fine. Another host > appears to have no information available in either of the expected log > files, though it stays current within the web interface to nagios. > > Sorry to be such a feeb, but I've only been working on this a couple > days and have been unable to find an answer in docs, google, or > archives. Please let me know what configuration information I can dig > up to be helpful. > > Thanks again, > Travis ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From dist-list at LEXUM.UMontreal.CA Thu Sep 8 19:19:14 2005 From: dist-list at LEXUM.UMontreal.CA (FM) Date: Thu, 08 Sep 2005 13:19:14 -0400 Subject: check_nrpe missing from Dag RPMS ? Message-ID: <43207292.5060709@lexum.umontreal.ca> Hello , I installed RPMS: nagios-plugins-1.4-2 nagios-2.0-0.b4.1 From dag.wieers RPMS repository. But I cannot find check_nrpe During RPM creation (using src RPMS) do I have to add a config parameter to have check_nrpe ? thanks ! ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at gridshield.net Thu Sep 8 19:28:09 2005 From: marc at gridshield.net (Marc DeTrano) Date: Thu, 08 Sep 2005 11:28:09 -0600 Subject: circular ballon status map quesition In-Reply-To: <4315CD63.3000203@gridshield.net> References: <4315CD63.3000203@gridshield.net> Message-ID: <432074A9.3000206@gridshield.net> Aha, think I finally got it -- the fill colors are related to state duration. If a host has been in a certain state for more than an hour, the fill goes away. That is a very nice feature, actually. Marc Marc DeTrano wrote: > Thanks for the help--still working on the fill colors. > > I thought it might have something to do with indicating problem states > in the services, but when I checked it over that did not add up (some > hosts with service problems would still have no fills). > > Sometimes the color is yellow, sometime orange, think I might have > seen gray once. > > > On a related note, anyone know of any alternative status map cgi's > (checked Nagios Exchange and googled, did not find much)--perhaps > there is something in the works? This cgi was never a big deal to me > personally except to check outages, but potential clients always point > at the maps in (insert $6-figure proprietary solution here) and say > "wow, cool map" (how's that for a design spec---"make it cooler"). I > am mulling over starting a project myself, but it seems a little > daunting. > > Marc > > > > Andrew Cruse wrote: > >> nagios-users-admin at lists.sourceforge.net wrote: >> >> >>> Don't know what determines the size of the circles but note >>> that things like LAN/WAN interfaces are small and devices >>> hanging off the LAN segments like servers, workstations and >>> printers are bigger on my map. >>> >> >> >> The size of the circle corresponds to the relative number of services >> associated with the host. >> >> Andrew >> >> >> > > > > ------------------------------------------------------- > SF.Net email is Sponsored by the Better Software Conference & EXPO > September 19-22, 2005 * San Francisco, CA * Development Lifecycle > Practices > Agile & Plan-Driven Development * Managing Projects & Teams * Testing > & QA > Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. ::: Messages without supporting info will risk > being sent to /dev/null ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From pete at stuff-done.co.uk Thu Sep 8 19:31:18 2005 From: pete at stuff-done.co.uk (Pete Dewell) Date: Thu, 08 Sep 2005 18:31:18 +0100 Subject: check_nrpe missing from Dag RPMS ? In-Reply-To: <43207292.5060709@lexum.umontreal.ca> References: <43207292.5060709@lexum.umontreal.ca> Message-ID: <43207566.2070601@stuff-done.co.uk> Try http://dag.wieers.com/packages/nagios-nrpe/ nrpe is a separate package to both Nagios and the Nagios plugins. Pete Dewell FM wrote: > Hello , > I installed RPMS: > nagios-plugins-1.4-2 > nagios-2.0-0.b4.1 > From dag.wieers RPMS repository. > > But I cannot find check_nrpe > > During RPM creation (using src RPMS) do I have to add a config parameter > to have check_nrpe ? > > thanks ! > > > > ------------------------------------------------------- > SF.Net email is Sponsored by the Better Software Conference & EXPO > September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices > Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA > Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. ::: Messages without supporting info will risk > being sent to /dev/null -- Pete Dewell | Stuff Done pete at stuff-done.co.uk ** The information contained in this message, including any attachment, is confidential and may be privileged or otherwise protected from disclosure. The information is intended only for the person or entity to which it is addressed. If you are not the intended recipient, please contact the sender and delete this message from your system. Any review, re-transmission, distribution or other use of this information by persons or entities other than the intended recipient is prohibited. * ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From dist-list at LEXUM.UMontreal.CA Thu Sep 8 19:36:15 2005 From: dist-list at LEXUM.UMontreal.CA (FM) Date: Thu, 08 Sep 2005 13:36:15 -0400 Subject: check_nrpe missing from Dag RPMS ? In-Reply-To: <43207566.2070601@stuff-done.co.uk> References: <43207292.5060709@lexum.umontreal.ca> <43207566.2070601@stuff-done.co.uk> Message-ID: <4320768F.1030901@lexum.umontreal.ca> here is what provides nagios-nrpe : /etc/nagios /etc/nagios/nrpe.cfg /etc/rc.d/init.d/nrpe /etc/xinetd.d/nrpe /usr/sbin/nrpe /usr/share/doc/nagios-nrpe-2.0 /usr/share/doc/nagios-nrpe-2.0/Changelog /usr/share/doc/nagios-nrpe-2.0/LEGAL /usr/share/doc/nagios-nrpe-2.0/README no check_nrpe :-) Pete Dewell wrote: > Try http://dag.wieers.com/packages/nagios-nrpe/ > > nrpe is a separate package to both Nagios and the Nagios plugins. > > Pete Dewell > > FM wrote: > >> Hello , >> I installed RPMS: >> nagios-plugins-1.4-2 >> nagios-2.0-0.b4.1 >> From dag.wieers RPMS repository. >> >> But I cannot find check_nrpe >> >> During RPM creation (using src RPMS) do I have to add a config >> parameter to have check_nrpe ? >> >> thanks ! >> >> >> >> ------------------------------------------------------- >> SF.Net email is Sponsored by the Better Software Conference & EXPO >> September 19-22, 2005 * San Francisco, CA * Development Lifecycle >> Practices >> Agile & Plan-Driven Development * Managing Projects & Teams * Testing >> & QA >> Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf >> _______________________________________________ >> Nagios-users mailing list >> Nagios-users at lists.sourceforge.net >> https://lists.sourceforge.net/lists/listinfo/nagios-users >> ::: Please include Nagios version, plugin version (-v) and OS when >> reporting any issue. ::: Messages without supporting info will risk >> being sent to /dev/null > > -- Frederic Medery System Administrator LexUM, University of Montreal ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From pete at stuff-done.co.uk Thu Sep 8 19:55:32 2005 From: pete at stuff-done.co.uk (Pete Dewell) Date: Thu, 08 Sep 2005 18:55:32 +0100 Subject: check_nrpe missing from Dag RPMS ? In-Reply-To: <4320768F.1030901@lexum.umontreal.ca> References: <43207292.5060709@lexum.umontreal.ca> <43207566.2070601@stuff-done.co.uk> <4320768F.1030901@lexum.umontreal.ca> Message-ID: <43207B14.5040806@stuff-done.co.uk> My apologies, I didn't notice that it was just the daemon. In that case, I would suggest you get the source code and build from scratch. I have never had any trouble building nrpe, on any platform. Get src from http://www.nagiosexchange.org/NRPE.77.0.html Pete Dewell FM wrote: > here is what provides nagios-nrpe : > /etc/nagios > /etc/nagios/nrpe.cfg > /etc/rc.d/init.d/nrpe > /etc/xinetd.d/nrpe > /usr/sbin/nrpe > /usr/share/doc/nagios-nrpe-2.0 > /usr/share/doc/nagios-nrpe-2.0/Changelog > /usr/share/doc/nagios-nrpe-2.0/LEGAL > /usr/share/doc/nagios-nrpe-2.0/README > > no check_nrpe :-) > > > > Pete Dewell wrote: > >> Try http://dag.wieers.com/packages/nagios-nrpe/ >> >> nrpe is a separate package to both Nagios and the Nagios plugins. >> >> Pete Dewell >> >> FM wrote: >> >>> Hello , >>> I installed RPMS: >>> nagios-plugins-1.4-2 >>> nagios-2.0-0.b4.1 >>> From dag.wieers RPMS repository. >>> >>> But I cannot find check_nrpe >>> >>> During RPM creation (using src RPMS) do I have to add a config >>> parameter to have check_nrpe ? >>> >>> thanks ! >>> >>> >>> >>> ------------------------------------------------------- >>> SF.Net email is Sponsored by the Better Software Conference & EXPO >>> September 19-22, 2005 * San Francisco, CA * Development Lifecycle >>> Practices >>> Agile & Plan-Driven Development * Managing Projects & Teams * Testing >>> & QA >>> Security * Process Improvement & Measurement * >>> http://www.sqe.com/bsce5sf >>> _______________________________________________ >>> Nagios-users mailing list >>> Nagios-users at lists.sourceforge.net >>> https://lists.sourceforge.net/lists/listinfo/nagios-users >>> ::: Please include Nagios version, plugin version (-v) and OS when >>> reporting any issue. ::: Messages without supporting info will risk >>> being sent to /dev/null >> >> >> > -- Pete Dewell | Stuff Done pete at stuff-done.co.uk ** The information contained in this message, including any attachment, is confidential and may be privileged or otherwise protected from disclosure. The information is intended only for the person or entity to which it is addressed. If you are not the intended recipient, please contact the sender and delete this message from your system. Any review, re-transmission, distribution or other use of this information by persons or entities other than the intended recipient is prohibited. * ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ygonzales at medassets.com Thu Sep 8 20:17:33 2005 From: ygonzales at medassets.com (Gonzales, Youn) Date: Thu, 8 Sep 2005 13:17:33 -0500 Subject: PerfParse Message-ID: <99CF04974931C548B2BF20CD898FCBA7B7503F@uscpgmedexch01.medassets.com> I am running PerfParse v0.105.6 and Nagios 2.0b4. The performance graphs work on all of the services I have setup so far except check_ping. Is there an alternative to check_ping that will let us graph the ping times to different network devices or am I just missing something in the config? My generic-service config: define service{ name generic-service active_checks_enabled 1 passive_checks_enabled 1 parallelize_check 1 obsess_over_service 1 check_freshness 0 notifications_enabled 1 event_handler_enabled 1 flap_detection_enabled 1 failure_prediction_enabled 1 process_perf_data 1 retain_status_information 1 retain_nonstatus_information 1 register 0 } One of my host configs: define service{ use generic-service host_name uscpg1stwap1 service_description PING is_volatile 0 check_period 24x7 max_check_attempts 4 normal_check_interval 5 retry_check_interval 1 contact_groups admins notification_interval 960 notification_period 24x7 check_command check_ping!100.0,20%!500.0,60% } Youn Gonzales Network Engineer MedAssets Supply Chain Systems 280 S Mount Auburn Rd Cape Girardeau, MO 63703 (573) 332-2285 Phone (573) 332-2300 Fax ygonzales at medassets.com "The information transmitted is intended only for the person or entity to which it is addressed and may contain confidential, proprietary, and/or privileged material. Any review, retransmission, dissemination or other use of, or taking of any action in reliance upon this information by persons or entities other than the intended recipient is prohibited. If you received this in error, please contact the sender and delete the material from all computers" ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Thu Sep 8 20:20:46 2005 From: ae at op5.se (Andreas Ericsson) Date: Thu, 08 Sep 2005 20:20:46 +0200 Subject: PerfParse In-Reply-To: <99CF04974931C548B2BF20CD898FCBA7B7503F@uscpgmedexch01.medassets.com> References: <99CF04974931C548B2BF20CD898FCBA7B7503F@uscpgmedexch01.medassets.com> Message-ID: <432080FE.1070504@op5.se> Gonzales, Youn wrote: > I am running PerfParse v0.105.6 and Nagios 2.0b4. The performance graphs > work on all of the services I have setup so far except check_ping. Is > there an alternative to check_ping that will let us graph the ping times Yes. check_icmp. http://oss.op5.se/nagios Have fun. -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Lead Developer ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Thu Sep 8 20:25:35 2005 From: marc at ena.com (Marc Powell) Date: Thu, 8 Sep 2005 13:25:35 -0500 Subject: checking process time Message-ID: > -----Original Message----- > From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users- > admin at lists.sourceforge.net] On Behalf Of Rossz Vamos-Wentworth > Sent: Thursday, September 08, 2005 11:26 AM > To: nagios-users at lists.sourceforge.net > Subject: [Nagios-users] checking process time > > I have a perl script used as a pipe for email that does some special > processing of data. Occassionally, unfortunately, it gets "stuck" and > does not terminate. When this happens, it ends up using most of the CPU > and pretty much screws up the system. Until I can track down what is > causing the infinite loop I was wondering if there was a way to check > the life of a process of a specific name and execute an event handler if > it's been running too long. The script should only take a few seconds > to run, so I figure if it is more than a few minutes old I can simply > have nagios kill the problem process (e.g. (kill -9 pid" should do the > job). Nagios-plugins-1.4.1 check_procs *under linux* adds an additional metric called ELAPSED which appears to allow for checking how long a process has been running. I've tried testing it but the call to ps isn't including the 'etime' option ala "/bin/ps -axwo 'stat uid ppid vsz rss pcpu comm args etime'" so it isn't working properly. It looks to me like configure tests less informative variations of the ps command first and if one of those matches it will use that for the ps format instead of progressing to more informative variations, including the one that has etime. From configure.log -- configure:14078: result: /bin/ps configure:14086: checking for ps syntax configure:14095: result: /bin/ps axwo 'stat uid pid ppid vsz rss pcpu comm args' when in fact, the one that includes etime works correctly (taken from configure) -- $ ps -weo 'stat comm vsz rss user uid pid ppid etime args' STAT COMMAND VSZ RSS USER UID PID PPID ELAPSED COMMAND S init 1376 368 root 0 1 0 132-06:03:18 init SW keventd 0 0 root 0 2 1 132-06:03:17 [keventd] SWN ksoftirqd_CPU0 0 0 root 0 3 1 132-06:03:17 [ksoftirqd_CPU0] Can anyone else confirm this as a bug? I don't see anything in the tracker. -- Marc ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Thu Sep 8 20:36:20 2005 From: ae at op5.se (Andreas Ericsson) Date: Thu, 08 Sep 2005 20:36:20 +0200 Subject: checking process time In-Reply-To: References: Message-ID: <432084A4.2060109@op5.se> Marc Powell wrote: > >>-----Original Message----- >>From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users- >>admin at lists.sourceforge.net] On Behalf Of Rossz Vamos-Wentworth >>Sent: Thursday, September 08, 2005 11:26 AM >>To: nagios-users at lists.sourceforge.net >>Subject: [Nagios-users] checking process time >> >>I have a perl script used as a pipe for email that does some special >>processing of data. Occassionally, unfortunately, it gets "stuck" and >>does not terminate. When this happens, it ends up using most of the > > CPU > >>and pretty much screws up the system. Until I can track down what is >>causing the infinite loop I was wondering if there was a way to check >>the life of a process of a specific name and execute an event handler > > if > >>it's been running too long. The script should only take a few seconds >>to run, so I figure if it is more than a few minutes old I can simply >>have nagios kill the problem process (e.g. (kill -9 pid" should do the >>job). > > > Nagios-plugins-1.4.1 check_procs *under linux* adds an additional metric > called ELAPSED which appears to allow for checking how long a process > has been running. I've tried testing it but the call to ps isn't > including the 'etime' option ala "/bin/ps -axwo 'stat uid ppid vsz rss > pcpu comm args etime'" so it isn't working properly. It looks to me like > configure tests less informative variations of the ps command first and > if one of those matches it will use that for the ps format instead of > progressing to more informative variations, including the one that has > etime. From configure.log -- > > configure:14078: result: /bin/ps > configure:14086: checking for ps syntax > configure:14095: result: /bin/ps axwo 'stat uid pid ppid vsz rss pcpu > comm args' > > when in fact, the one that includes etime works correctly (taken from > configure) -- > > $ ps -weo 'stat comm vsz rss user uid pid ppid etime args' > STAT COMMAND VSZ RSS USER UID PID PPID ELAPSED > COMMAND > S init 1376 368 root 0 1 0 132-06:03:18 > init > SW keventd 0 0 root 0 2 1 132-06:03:17 > [keventd] > SWN ksoftirqd_CPU0 0 0 root 0 3 1 132-06:03:17 > [ksoftirqd_CPU0] > > Can anyone else confirm this as a bug? I don't see anything in the > tracker. > I have a vague memory of this being because some systems failed silently in the configure test, causing check_procs to sigsegv in whatever configuration it ran. I believe they were re-arranged rather than dropped so it would be easy to re-enable it later. cvs log should tell you. > -- > Marc > > > ------------------------------------------------------- > SF.Net email is Sponsored by the Better Software Conference & EXPO > September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices > Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA > Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Lead Developer ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Leonard.Miller at baesystems.com Thu Sep 8 20:40:21 2005 From: Leonard.Miller at baesystems.com (Leonard Miller) Date: Thu, 08 Sep 2005 13:40:21 -0500 Subject: Status Map Woes Message-ID: Me thinks me fingered it out. I missed the default_statusmap_layout option. DOH!!!! >>> "Leonard Miller" 9/8/2005 8:44:17 AM >>> Hi, I've recently seen problems with status map, now it's my turn. I can see the map, but all my hosts are in a circle with the ? image representing each host. I created a hostextinfo.cfg file and added cfg_file=/usr/local/nagios/etc/hostextinfo.cfg to my nagios.cfg file. I tried changing the coords for my host several times, but it just won't budge. So far this is all I have in my hostextinfo.cfg file: define hostextinfo{ host_name HOSTNAME notes Wireless WDS 2d_coords 300,550 3d_coords 100.0,50.0,75.0 } Running Nagios 2.0b4 What I am trying to do is move because I want set my wireless radios in a tree under the WDS. Thanks in advance Leonard ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mark.ahlstrom at managedmail.com Thu Sep 8 21:27:02 2005 From: mark.ahlstrom at managedmail.com (Mark Ahlstrom) Date: Thu, 08 Sep 2005 14:27:02 -0500 Subject: Event handler question Message-ID: <1126207622.8138.27.camel@mediis> I'm trying to put local event handlers in place for a couple of services. I created a test script so I could understand the interaction between nagios, the script and the events. The test script does nothing other than print the passed arguments to a file, in perl, "arg# :: arg-value". The script is supposed to execute when a state change occurs, and you can see the event handler logging in nagios.log. But the script is not executed. It's only executed when the service returns. Here's my log from one test (filling up the file system)... [1126205981] EXTERNAL COMMAND: SCHEDULE_FORCED_SVC_CHECK;smt;DISKS;1126205963 [1126206041] SERVICE ALERT: smt;DISKS;CRITICAL;SOFT;1;Critical: /(95%) [1126206041] SERVICE EVENT HANDLER: smt;DISKS;CRITICAL;SOFT;1;handler-service-test [1126206171] SERVICE ALERT: smt;DISKS;CRITICAL;SOFT;2;Critical: /(95%) [1126206171] SERVICE EVENT HANDLER: smt;DISKS;CRITICAL;SOFT;2;handler-service-test [1126206299] SERVICE ALERT: smt;DISKS;CRITICAL;HARD;3;Critical: /(95%) [1126206299] SERVICE NOTIFICATION: mark-mail;smt;DISKS;CRITICAL;notify-by-email;Critical: /(95%) [1126206300] SERVICE NOTIFICATION: mark;smt;DISKS;CRITICAL;notify-by-epager;Critical: /(95%) [1126206300] SERVICE NOTIFICATION: mark;smt;DISKS;CRITICAL;notify-by-email;Critical: /(95%) [1126206300] SERVICE EVENT HANDLER: smt;DISKS;CRITICAL;HARD;3;handler-service-test [1126206339] EXTERNAL COMMAND: SCHEDULE_FORCED_SVC_CHECK;smt;DISKS;1126206327 [1126206399] EXTERNAL COMMAND: ADD_SVC_COMMENT;smt;DISKS;1;nscmd;ACKNOWLEDGEMENT via SleepNscmd [1126206399] EXTERNAL COMMAND: ACKNOWLEDGE_SVC_PROBLEM;smt;DISKS;0;ACKNOWLEDGEMENT via SleepNscmd [1126206404] SERVICE ALERT: smt;DISKS;OK;HARD;3;All disks below warning/critical thresholds [1126206404] SERVICE NOTIFICATION: mark-mail;smt;DISKS;OK;notify-by-email;All disks below warning/critical thresholds [1126206404] SERVICE EVENT HANDLER: smt;DISKS;OK;HARD;3;handler-service-test Here's the script output from this one test. 0 :: OK 1 :: HARD 2 :: 3 3 :: smt 4 :: DISKS 5 :: 1126206404 6 :: All 7 :: disks 8 :: below 9 :: warning/critical 10 :: thresholds I included the $TIMET$ and $OUTPUT$ macros, and you can clearly see that the time correlates only to the hard recovery. Am I missing a configuration switch somewhere? I have event_handlers enabled in the nagios.cfg and in the services.cfg. Mark ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mark.ahlstrom at managedmail.com Thu Sep 8 22:30:56 2005 From: mark.ahlstrom at managedmail.com (Mark Ahlstrom) Date: Thu, 08 Sep 2005 15:30:56 -0500 Subject: Event handler question Message-ID: <1126211456.8138.40.camel@mediis> Wouldn't you guess that I would figure out why the event handler wasn't executing just after I posted. The handler will only execute upon a recovery when I have $OUTPUT$ defined in the command. Once I yank that out of the command definition, the handler executes as the documentation states. Mark ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Thu Sep 8 23:19:47 2005 From: marc at ena.com (Marc Powell) Date: Thu, 8 Sep 2005 16:19:47 -0500 Subject: Event handler question Message-ID: > -----Original Message----- > From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users- > admin at lists.sourceforge.net] On Behalf Of Mark Ahlstrom > Sent: Thursday, September 08, 2005 3:31 PM > To: nagios-users at lists.sourceforge.net > Subject: [Nagios-users] RE: Event handler question > > Wouldn't you guess that I would figure out why the event handler wasn't > executing just after I posted. > > The handler will only execute upon a recovery when I have $OUTPUT$ > defined in the command. Once I yank that out of the command definition, > the handler executes as the documentation states. I don't use event handlers but that seems awfully strange. Does the output include special characters like ', ", &, etc that aren't being quoted properly when in a non-OK state? -- Marc ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mark.ahlstrom at managedmail.com Fri Sep 9 00:40:23 2005 From: mark.ahlstrom at managedmail.com (Mark Ahlstrom) Date: Thu, 08 Sep 2005 17:40:23 -0500 Subject: event_handler oddity Message-ID: <1126219223.8138.50.camel@mediis> This is the result of a simple event handler I set up to test it's interoperability and interaction. The one I had a tough time getting to run this morning. This email went out when it reached a hard state change. The fun thing about this is that the service, DISKS, is in down time. A notification didn't go out but the script did execute! Does anyone know if it's supposed to happen this way? -- Mark -------- Forwarded Message -------- From: Nagios User To: ahlstrom Subject: event_handler Date: Thu, 8 Sep 2005 17:21:01 -0500 (CDT) 1126218061 :: DISKS on smt has reached attempt number 3 for the HARD state ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From tdondich at gmail.com Fri Sep 9 01:27:16 2005 From: tdondich at gmail.com (Taylor Dondich) Date: Thu, 8 Sep 2005 16:27:16 -0700 Subject: Fruity 1.0 Beta 2 is released! Message-ID: <3d2eb72a05090816274cbee2f2@mail.gmail.com> Your favorite Nagios Configuration Tool has been updated! http://fruity.sf.net Beta 2 is released! This release fixes a great deal of bugs found in Beta 1 and attempts to resolve all Nagios 2.x functionality. The DB Schema has changed, so please be sure to update your SQL schema file. Note: The database schema has changed, update your sql schema with the clean SQL file located in sqldata. CHANGELOG SINCE BETA1-PL3: - Host Check Command Parameters - Added Search Capability - Increased Side Navigation Functionality - Numerous Bug Fixes - New Open Source Developer: Craig A. Hancock - DB Schema Change (Please update your DB Schema) If you have any questions, problems, or bugs, please refer to the Sourceforge project page. http://fruity.sf.net ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From sampinar at gmail.com Fri Sep 9 06:01:39 2005 From: sampinar at gmail.com (Sam Pinar) Date: Fri, 9 Sep 2005 14:01:39 +1000 Subject: Bluetooth ?? Message-ID: <4aa215380509082101253ef89@mail.gmail.com> Hi all, Has anyone tried to send email alerts via bluetooth? I know its sounds abit stupid but could be useful in some aspects. Anyone with suggestions, or anyone who has it running or wants to get it running, let me know. Cheers, Sam -------------- next part -------------- An HTML attachment was scrubbed... URL: From enst at rao.elektra.ru Fri Sep 9 06:09:58 2005 From: enst at rao.elektra.ru (Evgeny Stepanov) Date: Fri, 9 Sep 2005 08:09:58 +0400 Subject: Remote monitoring (distributed) Message-ID: <1856173825.20050909080958@rao.elektra.ru> Hello everyone! I'm Nagios newbie and have some doubts. What am i doing? I have nagios host that is doing active checks for the hosts it can reach and is doing some checks via nsclient++ on remote hosts for the hosts it can't reach directly. so, i made the config like that: define host{ use generic-host host_name my-distant-host alias Distant host @ Khabarovsk # address 10.216.4.42 check_command check-dist-host!10.216.4.42!50,100!1500,2000 check_interval 3 max_check_attempts 10 notification_interval 120 notification_period 24x7 notification_options d,u,r contact_groups admins } The command check-dist-host looks like define command{ command_name check-dist-host command_line $USER1$/check_nrpe2 -H 192.168.11.8 -c check_dist_host_alive!$ARG1$!$ARG2$!$ARG3$ } 192.168.11.8 is the host, that is actually performing ping with the perl check_ping.pl script. I commented out the address field because i used to get a lot of flapping, because (as far as think) nagios checks whether host is alive in some extra way other than check_command. With address field commented it works well. but i think it's not the _right_ way of doing things in nagios. Any ideas? How do you resolve these situations of monitoring different network resources across the network?. I thought of distributed monitoring, but unfortunetly i don't have any *nix hosts in 192.168.0.0 network to put nagios on it. And i don't want to make any holes on my firewalls and routers to route all these dummy networks... Any help would be great! Best regards, Evgeny Stepanov ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From VINAY_SHARMA at advanex.co.jp Fri Sep 9 06:09:57 2005 From: VINAY_SHARMA at advanex.co.jp (VINAY_SHARMA at advanex.co.jp) Date: Fri, 9 Sep 2005 13:09:57 +0900 Subject: Bluetooth ?? Message-ID: Dear All, Is anyone has configured monitoring disk and cpu laod with nagios. i have configurd nagios 1st time and its running very good with alert and only ping and smtp check command.i want to monitor my some windows serevrs with cpu load and disk space. will appriciating if some one tell me configuration with current instllation of nagios. Thanks & regards ************************************** Vinay Sharma Information Systems and Solutions-Associates Advanex Inc (www.advanex.co.jp) Phone : 813-3822-5863 Fax : 813-5815-7881 Email : vinay_sharma at advanex.co.jp Sam Pinar ??: Nagios User Forums ???: cc: nagios-users-admin at lists.sour ??: [Nagios-users] Bluetooth ?? ceforge.net 2005/09/09 13:01 sampinar ????????? Hi all, Has anyone tried to send email alerts via bluetooth? I know its sounds abit stupid but could be useful in some aspects. Anyone with suggestions, or anyone who has it running or wants to get it running, let me know. Cheers, Sam ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Lui.external at infineon.com Fri Sep 9 07:21:04 2005 From: Lui.external at infineon.com (Lui.external at infineon.com) Date: Fri, 9 Sep 2005 13:21:04 +0800 Subject: Help: Nagios Core Program problem Message-ID: > Hi, > I am trying to install the nagios core program in my linux server. > I had already downloaded the nagios-1.2 and having glib-1.2 installed > in my server. Here are the steps that I had performed: > # cd nagios-1.2 > # ./configure --prefix=/home/samadm/nagios --with-nagios-user=samadm > --with-nagios-grp=`id -gn` --with-default-extinfo > --with-default-objects --with-ping-command='/bin/ping -w 56 -U -n -c > %d %s' > # make all > > I get an error as shown below when I am trying to execute the "make > all" command. > edata.o(.text+0x28): In function `read_extended_object_config_data': > /home/samadm/src/nagios-1.2/cgi/edata.c:85: undefined reference to > `xedtemplate_read_extended_object_config_data' > collect2: ld returned 1 exit status > make[1]: *** [extinfo.cgi] Error 1 > make[1]: Leaving directory `/home/samadm/src/nagios-1.2/cgi' > make: *** [all] Error 2 > May I know what is the problem behind? > When I opened the nagios website(http://172.17.22.22:8888/nagios/), I > get the error: 404 not found > > Is there anyone can help to solve this problem? > > Appreciate your help. > > Regards, > Louise ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From amontibello at gmail.com Fri Sep 9 07:52:23 2005 From: amontibello at gmail.com (Anthony Montibello) Date: Fri, 9 Sep 2005 01:52:23 -0400 Subject: check_nt_cpuload help In-Reply-To: <43202F4C.10805@tgs-solutions.com> References: <43202F4C.10805@tgs-solutions.com> Message-ID: For Checking the Average CPU Load over the past Hour use the following: check_command check_nt_cpuload!60,80,90,90,80,90 your command definition should be something like this: define command{ command_name check_nt_cpuload command_line $USER1$/check_nt -H $HOSTADDRESS$ -t 5 -p 1248 -v CPULOAD -l $ARG1$ } The way this command works is the -l is in sets of three (triplet) separated by a comma the first is the # of minutes to check, then the warning percent, last the critical percent value do not use % in the command, Ns_client and NC_Net can check more than one average at a time so the above command will check the load over the last hour and the last 1.5 hours. NS_Client and NC_net check the CPU Load several times a minute and then calculate the average when it receives the check_nt command. If using NC_Net make sure to increase the Parameter Cpu_max_interval to larger than the maximum minutes you will be checking. Hope this helps, Tony On 9/8/05, Sudheer Muddappa wrote: > > Hi guys, > > In my services.cfg file I have for check_nt_cpuload as > > define service{ > host_name stealth > service_description CPU LOAD > is_volatile 0 > check_period 24x7 > max_check_attempts 3 > normal_check_interval 5 > retry_check_interval 1 > contact_groups server-admins > notification_interval 240 > notification_period 24x7 > notification_options u,c,r > check_command > check_nt_cpuload!60!80%!90%!90!80%!90! > } > > Is this is correct? > > Just want to check the cpu load every 60 minutes. > > On the web page I see a message saying > > not enough values for -l parameters > > > check_disk is workgin properly for the same server. > > Thanks, > > -- > Sudheer Muddappa > > > > > > ------------------------------------------------------- > SF.Net email is Sponsored by the Better Software Conference & EXPO > September 19-22, 2005 * San Francisco, CA * Development Lifecycle > Practices > Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA > Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -------------- next part -------------- An HTML attachment was scrubbed... URL: From schoenfeld at in-medias-res.com Fri Sep 9 09:44:42 2005 From: schoenfeld at in-medias-res.com (=?ISO-8859-15?Q?sch=F6nfeld_/_in-medias-res?=) Date: Fri, 09 Sep 2005 09:44:42 +0200 Subject: Question about NRPE operation Message-ID: <43213D6A.5020209@in-medias-res.com> Hi, i'm having some problems with checking a ncpfs filesystem and got a suspicious on my mind, so i have a question about how the NRPE does operate. Ok, here we go: Nagios initiates a check of a particular service on Host X, so it does send a request to the NRPE daemon on Host X. Host X checks the request and starts the plugin which can do the requested service check and switches into wating state => waiting for the plugin answere. Now imagine that the ncpfs is busy, because of another "heavy operation" on it. So the plugin runs and runs, but has to wait for the filesystem and does not return a result to the nrpe in the meanwhile. So now the question is: Does the NRPE Server has an timeout after which it'll *kill* the plugin? If so: Linux ncpfs is not able of threading ncpfs operations. So if one process is accessing the ncpfs and gets a SIGKILL, the ncp connection becomes invalid and the source of my problem would be identified. Thanks in advance Greets Patrick ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Fri Sep 9 10:41:13 2005 From: ae at op5.se (Andreas Ericsson) Date: Fri, 09 Sep 2005 10:41:13 +0200 Subject: Question about NRPE operation In-Reply-To: <43213D6A.5020209@in-medias-res.com> References: <43213D6A.5020209@in-medias-res.com> Message-ID: <43214AA9.9060404@op5.se> sch?nfeld / in-medias-res wrote: > Hi, > > i'm having some problems with checking a ncpfs filesystem and > got a suspicious on my mind, so i have a question about how > the NRPE does operate. > > Ok, here we go: > Nagios initiates a check of a particular service on Host X, > so it does send a request to the NRPE daemon on Host X. > > Host X checks the request and starts the plugin which can do the > requested service check and switches into wating state => waiting for > the plugin answere. > > Now imagine that the ncpfs is busy, because of another "heavy operation" > on it. So the plugin runs and runs, but has to wait for the filesystem > and does not return a result to the nrpe in the meanwhile. > The plugin itself is supposed to exit gracefully after some specified amount of maximum time. This is generally achieved by installing a signal-handler to catch SIGALRM and making an alarm(2) call. The signal-handler should make sure all locks and resources are released (the kernel will handle it otherwise, but that's considered terribly bad form). > So now the question is: Does the NRPE Server has an timeout after which > it'll *kill* the plugin? Yes, naturally. Otherwise it could risk filling up the process-table, or plugins with infinite loops could bring the entire system down. > If so: Linux ncpfs is not able of threading > ncpfs operations. So if one process is accessing the ncpfs and gets > a SIGKILL, the ncp connection becomes invalid and the source of my > problem would be identified. > This really can't be. Any locks and resources held by a terminated process should be cleared by the kernel (if not by the process itself). If they aren't, you've found a kernel bug. -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Lead Developer ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From schoenfeld at in-medias-res.com Fri Sep 9 11:07:02 2005 From: schoenfeld at in-medias-res.com (=?ISO-8859-15?Q?sch=F6nfeld_/_in-medias-res?=) Date: Fri, 09 Sep 2005 11:07:02 +0200 Subject: Question about NRPE operation In-Reply-To: <43214AA9.9060404@op5.se> References: <43213D6A.5020209@in-medias-res.com> <43214AA9.9060404@op5.se> Message-ID: <432150B6.1080705@in-medias-res.com> Hi, thanks for your answere. It helped me a lot. Andreas Ericsson schrieb: >> If so: Linux ncpfs is not able of threading >> ncpfs operations. So if one process is accessing the ncpfs and gets >> a SIGKILL, the ncp connection becomes invalid and the source of my >> problem would be identified. >> > > This really can't be. Any locks and resources held by a terminated > process should be cleared by the kernel (if not by the process itself). > If they aren't, you've found a kernel bug. Yes it is a bug in the ncpfs code of the kernel. Well not really a bug but an implementation weakness. The maintainer of the kernel ncpfs code So yes i could take it as a bug or just as an implementation weakness. Anyways would that identify my problems and make me really unhappy with that. stated the following in the kernel mailinglist a while ago: "You also must not send SIGKILL to processes which are in the middle of NCP transaction. Because of ncpfs does not use its own thread (or bh) to implement NCP ping-pong protocol, connection becomes invalid after such action, as ping-pong was not successfully completed." Well to know that doesn't make me happier but a little bit wiser. What could be a possible solution / workaround? Any ideas? Greets Patrick ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Fri Sep 9 11:12:20 2005 From: ae at op5.se (Andreas Ericsson) Date: Fri, 09 Sep 2005 11:12:20 +0200 Subject: Question about NRPE operation In-Reply-To: <432150B6.1080705@in-medias-res.com> References: <43213D6A.5020209@in-medias-res.com> <43214AA9.9060404@op5.se> <432150B6.1080705@in-medias-res.com> Message-ID: <432151F4.8060408@op5.se> sch?nfeld / in-medias-res wrote: > Hi, > > thanks for your answere. It helped me a lot. > > Andreas Ericsson schrieb: > >>>If so: Linux ncpfs is not able of threading >>>ncpfs operations. So if one process is accessing the ncpfs and gets >>>a SIGKILL, the ncp connection becomes invalid and the source of my >>>problem would be identified. >>> >> >>This really can't be. Any locks and resources held by a terminated >>process should be cleared by the kernel (if not by the process itself). >>If they aren't, you've found a kernel bug. > > > Yes it is a bug in the ncpfs code of the kernel. Well not really a bug > but an implementation weakness. The maintainer of the kernel ncpfs code > So yes i could take it as a bug or just as an implementation weakness. > Anyways would that identify my problems and make me really unhappy with > that. > stated the following in the kernel mailinglist a while ago: > > "You also must not send SIGKILL to processes which are in the middle of > NCP transaction. Because of ncpfs does not use its own thread (or bh) > to implement NCP ping-pong protocol, connection becomes invalid after > such action, as ping-pong was not successfully completed." > > Well to know that doesn't make me happier but a little bit wiser. > > What could be a possible solution / workaround? Any ideas? > Make sure the program holding the lock exits gracefully by catching a SIGALRM rather than receiving the SIGKILL from nrpe. If that doesn't work, a kernel-patch is required. -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Lead Developer ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From agabellini at intelcom.sm Fri Sep 9 13:22:08 2005 From: agabellini at intelcom.sm (Andrea Gabellini) Date: Fri, 09 Sep 2005 13:22:08 +0200 Subject: Perfdata truncated Message-ID: <6.2.1.2.2.20050909124548.026d1c30@mail.intelcom.sm> Hi, I wrote a plugin that return many perfdata. The output looks like it: Sessions in: 27 - Sessions out: 82 | '195.219.218.138'=1 'EX1'=1 'EX1-h323'=29 'EX2'=3 'EX2-h323'=15 'Ibasis-SIP-1'=3 'Ibasis-SIP-2'=1 'ditutel1-h323'=1 'ditutel10-h323'=1 'ditutel11-h323'=2 'ditutel12-h323'=2 'ditutel15-h323'=1 'ditutel18-h323'=1 'ditutel19-h323'=1 'ditutel2-h323'=2 'ditutel20-h323'=1 'ditutel3-h323'=1 'ditutel4-h323'=3 'ditutel5-h323'=1 'ditutel6-h323'=1 'ditutel7-h323'=1 'ipcrossing-h323'=1 'netglobalis-h323'=2 'planetit2-h323'=3 'planetit3-h323'=4 'planetit4-h323'=3 'silvertech-h323'=18 'technosphere-h323'=2 'technosphere2-h323'=1 'tseyva-h323'=3 In the perfdata-service.log file the perfdata is truncated like it: 1126260797 localhost DETAIL Sessions in: 17 - Sessions out: 54 OK 'EX1'=1 'EX1-h323'=13 'EX2'=1 'EX2-h323'=19 'Ibasis-SIP-1'=1 'Ibasis-SIP-2'=1 'ditutel10-h323'=1 'ditutel12-h323'=1 'ditutel13-h323'=1 'ditutel18-h323'=3 'ditutel19-h323'=1 'ditutel2-h323'=1 'ditutel20-h323'=2 'ditutel4-h323'=1 'ditutel5-h323'=1 'ditutel7-h323'=1 'ditutel9-h323'=3 'netglobalis-h323'=1 'plane I looked in the source code (2b4) but I didn't find anything (I'm not a real C programmer :-( ) Can someone help me? Thanks in advance, Andrea --------------------------------------- A real friend is someone who trusts you with her secrets, warms you with her heart, and remembers you in her prayers. --------------------------------------- Ing. Andrea Gabellini Email: agabellini at intelcom.sm Tel: 0549 886111 (Italy) Tel. +378 0549 886111 (International) Intelcom San Marino S.p.A. Strada degli Angariari, 3 47891 Rovereta Repubblic of San Marino http://www.omniway.sm http://www.intelcom.sm ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From schoenfeld at in-medias-res.com Fri Sep 9 13:28:05 2005 From: schoenfeld at in-medias-res.com (=?ISO-8859-15?Q?sch=F6nfeld_/_in-medias-res?=) Date: Fri, 09 Sep 2005 13:28:05 +0200 Subject: Question about NRPE operation In-Reply-To: <432151F4.8060408@op5.se> References: <43213D6A.5020209@in-medias-res.com> <43214AA9.9060404@op5.se> <432150B6.1080705@in-medias-res.com> <432151F4.8060408@op5.se> Message-ID: <432171C5.2050300@in-medias-res.com> Andreas Ericsson schrieb: > Make sure the program holding the lock exits gracefully by catching a > SIGALRM rather than receiving the SIGKILL from nrpe. Well the problem is: i can't, because the plugin is a script that relies on reading a specific file which is located on the novell filesystem. Well - if the problem occurs (with ncpfs) than the command gets into an infintive wait state, until nrpe sends a SIGKILL. If my script handles SIGALRM all it can do is do send a KILL signal to the task which tries to read from the ncpfs and then i would have won nothing but a broken novell connection. Anyways: We found a solution. The plugin hangs if novell connection is busy because of a process which is running constant and reading from the novell filesystem regulary. We did renice this process and now we don't have the problem, where the plugin gets into an infitive wait state. Problem solved. Greets Patrick ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From sxanrr at yahoo.com Fri Sep 9 14:42:33 2005 From: sxanrr at yahoo.com (Sxan) Date: Fri, 9 Sep 2005 05:42:33 -0700 (PDT) Subject: Host Down alert when Host is Up Message-ID: <20050909124233.14319.qmail@web32201.mail.mud.yahoo.com> Hey everyone, We just changed the vlan that 2 servers are on in one location and all of a sudden Nagios is now reporting those servers as being down, even though I'm able to ping them fine from the Nagios server itself and everything else is functioning just fine. I'm confused as to why this would happen. Any ideas? I'm also wondering if this could be related to the TTL possibly changing? If so, does anyone know any simple way to change the ttl that Nagios looks for or maybe increase it somehow on the box? Thanks as always! ~Jim ______________________________________________________ Click here to donate to the Hurricane Katrina relief effort. http://store.yahoo.com/redcross-donate3/ ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From sghosh at sghosh.org Fri Sep 9 15:51:52 2005 From: sghosh at sghosh.org (Subhendu Ghosh) Date: Fri, 9 Sep 2005 09:51:52 -0400 (EDT) Subject: Host Down alert when Host is Up In-Reply-To: <20050909124233.14319.qmail@web32201.mail.mud.yahoo.com> References: <20050909124233.14319.qmail@web32201.mail.mud.yahoo.com> Message-ID: On Fri, 9 Sep 2005, Sxan wrote: > Hey everyone, > > We just changed the vlan that 2 servers are on in one > location and all of a sudden Nagios is now reporting > those servers as being down, even though I'm able to > ping them fine from the Nagios server itself and > everything else is functioning just fine. I'm confused > as to why this would happen. Any ideas? I'm also > wondering if this could be related to the TTL possibly > changing? If so, does anyone know any simple way to > change the ttl that Nagios looks for or maybe increase > it somehow on the box? Thanks as always! > > ~Jim > TTL doesn't matter. Did you change the IP addr of the servers in the nagios config? Are you use state retention? -- -sg ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From f1216 at yahoo.com Fri Sep 9 16:05:00 2005 From: f1216 at yahoo.com (Fred) Date: Fri, 9 Sep 2005 07:05:00 -0700 (PDT) Subject: Host Down alert when Host is Up In-Reply-To: References: Message-ID: <20050909140500.40012.qmail@web31907.mail.mud.yahoo.com> On my configuration, I ping some smart switches via a host-check. If the TTL is out of bounds according to the host-check command it returns a warning or critical. If it returns a critical the switch is reported as "down". -FredC --- Subhendu Ghosh wrote: > On Fri, 9 Sep 2005, Sxan wrote: > > > Hey everyone, > > > > We just changed the vlan that 2 servers are on in one > > location and all of a sudden Nagios is now reporting > > those servers as being down, even though I'm able to > > ping them fine from the Nagios server itself and > > everything else is functioning just fine. I'm confused > > as to why this would happen. Any ideas? I'm also > > wondering if this could be related to the TTL possibly > > changing? If so, does anyone know any simple way to > > change the ttl that Nagios looks for or maybe increase > > it somehow on the box? Thanks as always! > > > > ~Jim > > > > TTL doesn't matter. Did you change the IP addr of the servers in the > nagios config? Are you use state retention? > > > -- > -sg > > > ------------------------------------------------------- > SF.Net email is Sponsored by the Better Software Conference & EXPO > September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices > Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA > Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting > any issue. > ::: Messages without supporting info will risk being sent to /dev/null > ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From sghosh at sghosh.org Fri Sep 9 16:09:54 2005 From: sghosh at sghosh.org (Subhendu Ghosh) Date: Fri, 9 Sep 2005 10:09:54 -0400 (EDT) Subject: Host Down alert when Host is Up In-Reply-To: <20050909140500.40012.qmail@web31907.mail.mud.yahoo.com> References: <20050909140500.40012.qmail@web31907.mail.mud.yahoo.com> Message-ID: On Fri, 9 Sep 2005, Fred wrote: > On my configuration, I ping some smart switches via a host-check. > If the TTL is out of bounds according to the host-check command > it returns a warning or critical. If it returns a critical the > switch is reported as "down". > > -FredC check_ping only support RTA and packet loss. You must be using something else.. > > --- Subhendu Ghosh wrote: > >> On Fri, 9 Sep 2005, Sxan wrote: >> >>> Hey everyone, >>> >>> We just changed the vlan that 2 servers are on in one >>> location and all of a sudden Nagios is now reporting >>> those servers as being down, even though I'm able to >>> ping them fine from the Nagios server itself and >>> everything else is functioning just fine. I'm confused >>> as to why this would happen. Any ideas? I'm also >>> wondering if this could be related to the TTL possibly >>> changing? If so, does anyone know any simple way to >>> change the ttl that Nagios looks for or maybe increase >>> it somehow on the box? Thanks as always! >>> >>> ~Jim >>> >> >> TTL doesn't matter. Did you change the IP addr of the servers in the >> nagios config? Are you use state retention? >> >> >> -- >> -sg >> >> -- ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From f1216 at yahoo.com Fri Sep 9 16:20:15 2005 From: f1216 at yahoo.com (Fred) Date: Fri, 9 Sep 2005 07:20:15 -0700 (PDT) Subject: Host Down alert when Host is Up In-Reply-To: References: Message-ID: <20050909142015.92058.qmail@web31903.mail.mud.yahoo.com> Yes, you are correct, I misspoke (typed?), however, my point remains that if the host-check returns a critical, nagios will treat the node as down. Whatever the command you are using ... "down" is a state which is defined by the host-check, it doesn't always mean the host is down, maybe its just not behaving as expected. -FredC --- Subhendu Ghosh wrote: > On Fri, 9 Sep 2005, Fred wrote: > > > On my configuration, I ping some smart switches via a host-check. > > If the TTL is out of bounds according to the host-check command > > it returns a warning or critical. If it returns a critical the > > switch is reported as "down". > > > > -FredC > > check_ping only support RTA and packet loss. You must be using something > else.. > > > > > --- Subhendu Ghosh wrote: > > > >> On Fri, 9 Sep 2005, Sxan wrote: > >> > >>> Hey everyone, > >>> > >>> We just changed the vlan that 2 servers are on in one > >>> location and all of a sudden Nagios is now reporting > >>> those servers as being down, even though I'm able to > >>> ping them fine from the Nagios server itself and > >>> everything else is functioning just fine. I'm confused > >>> as to why this would happen. Any ideas? I'm also > >>> wondering if this could be related to the TTL possibly > >>> changing? If so, does anyone know any simple way to > >>> change the ttl that Nagios looks for or maybe increase > >>> it somehow on the box? Thanks as always! > >>> > >>> ~Jim > >>> > >> > >> TTL doesn't matter. Did you change the IP addr of the servers in the > >> nagios config? Are you use state retention? > >> > >> > >> -- > >> -sg > >> > >> > > -- > > > > ------------------------------------------------------- > SF.Net email is Sponsored by the Better Software Conference & EXPO > September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices > Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA > Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting > any issue. > ::: Messages without supporting info will risk being sent to /dev/null > ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From blakekrone at gmail.com Fri Sep 9 16:36:29 2005 From: blakekrone at gmail.com (Blake Krone) Date: Fri, 9 Sep 2005 08:36:29 -0600 Subject: Setting env variables from misc commands (email from field) In-Reply-To: <1126195564.31899.38.camel@localhost> References: <1126195564.31899.38.camel@localhost> Message-ID: I know nullmailer doesn't ignore the USER variable as I can do it from command line. Hence why I asked it as a nagios question, because it seems the misc commands ignores multiple commands seperated by ; or it just doesn't read from ENV variables. It's more of a Nagios question as you should be able to set the from field in nagios. On 9/8/05, Chris Wilson wrote: > > Hi Blake, > > > I should have stated that I'm using nullmailer, don't need a full > > featured SMTP, I just relay off of our exchange server > > This is really a nullmailer question, not Nagios. Changing the address > should be done in nullmailer, and maybe it ignores the USER environment > variable. Check the nullmailer docs for how to do this. If it doesn't > support it, you could try another mailer (mailx maybe). > > Cheers, Chris. > -- > (aidworld) chris wilson | chief engineer (chris at aidworld.org) > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From sxanrr at yahoo.com Fri Sep 9 16:51:48 2005 From: sxanrr at yahoo.com (Sxan) Date: Fri, 9 Sep 2005 07:51:48 -0700 (PDT) Subject: Host Down alert when Host is Up In-Reply-To: <20050909142015.92058.qmail@web31903.mail.mud.yahoo.com> References: <20050909142015.92058.qmail@web31903.mail.mud.yahoo.com> Message-ID: <20050909145148.25642.qmail@web32210.mail.mud.yahoo.com> The IP addresses did change, but I accounted for that in the hosts.cfg file for this host. I'm not an expert with Nagios so I'm not sure totally what you mean when you're asking if I use state retention. But basically the situation is that everything worked fine yesterday, IP addresses were changed and the 2 servers were put on a new vlan, but they are still up and completely reachable by ping and whatever else. That's why I'm kind of confused. Here's the nagios notification if it helps at all... ***** Nagios ***** Notification Type: PROBLEM Host: boca-srv01 State: DOWN Address: 10.168.0.201 Info: CRITICAL - Plugin timed out after 10 seconds Date/Time: Fri Sept 9 10:47:18 EDT 2005 ~Jim --- Fred wrote: > > Yes, you are correct, I misspoke (typed?), however, > my point > remains that if the host-check returns a critical, > nagios will > treat the node as down. Whatever the command you > are using ... > > "down" is a state which is defined by the > host-check, it doesn't > always mean the host is down, maybe its just not > behaving as > expected. > > -FredC > > --- Subhendu Ghosh wrote: > > > On Fri, 9 Sep 2005, Fred wrote: > > > > > On my configuration, I ping some smart switches > via a host-check. > > > If the TTL is out of bounds according to the > host-check command > > > it returns a warning or critical. If it returns > a critical the > > > switch is reported as "down". > > > > > > -FredC > > > > check_ping only support RTA and packet loss. You > must be using something > > else.. > > > > > > > > --- Subhendu Ghosh wrote: > > > > > >> On Fri, 9 Sep 2005, Sxan wrote: > > >> > > >>> Hey everyone, > > >>> > > >>> We just changed the vlan that 2 servers are on > in one > > >>> location and all of a sudden Nagios is now > reporting > > >>> those servers as being down, even though I'm > able to > > >>> ping them fine from the Nagios server itself > and > > >>> everything else is functioning just fine. I'm > confused > > >>> as to why this would happen. Any ideas? I'm > also > > >>> wondering if this could be related to the TTL > possibly > > >>> changing? If so, does anyone know any simple > way to > > >>> change the ttl that Nagios looks for or maybe > increase > > >>> it somehow on the box? Thanks as always! > > >>> > > >>> ~Jim > > >>> > > >> > > >> TTL doesn't matter. Did you change the IP addr > of the servers in the > > >> nagios config? Are you use state retention? > > >> > > >> > > >> -- > > >> -sg > > >> > > >> > > > > -- > > > > > > > > > ------------------------------------------------------- > > SF.Net email is Sponsored by the Better Software > Conference & EXPO > > September 19-22, 2005 * San Francisco, CA * > Development Lifecycle Practices > > Agile & Plan-Driven Development * Managing > Projects & Teams * Testing & QA > > Security * Process Improvement & Measurement * > http://www.sqe.com/bsce5sf > > _______________________________________________ > > Nagios-users mailing list > > Nagios-users at lists.sourceforge.net > > > https://lists.sourceforge.net/lists/listinfo/nagios-users > > ::: Please include Nagios version, plugin version > (-v) and OS when reporting > > any issue. > > ::: Messages without supporting info will risk > being sent to /dev/null > > > > > > > > __________________________________________________ Do You Yahoo!? Tired of spam? Yahoo! Mail has the best spam protection around http://mail.yahoo.com ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Fri Sep 9 16:59:06 2005 From: marc at ena.com (Marc Powell) Date: Fri, 9 Sep 2005 09:59:06 -0500 Subject: Perfdata truncated Message-ID: > -----Original Message----- > From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users- > admin at lists.sourceforge.net] On Behalf Of Andrea Gabellini > Sent: Friday, September 09, 2005 6:22 AM > To: nagios-users at lists.sourceforge.net > Subject: [Nagios-users] Perfdata truncated > > Hi, > > I wrote a plugin that return many perfdata. The output looks like it: > > Sessions in: 27 - Sessions out: 82 | '195.219.218.138'=1 'EX1'=1 > 'EX1-h323'=29 'EX2'=3 'EX2-h323'=15 'Ibasis-SIP-1'=3 'Ibasis-SIP-2'=1 > 'ditutel1-h323'=1 'ditutel10-h323'=1 'ditutel11-h323'=2 'ditutel12-h323'=2 > 'ditutel15-h323'=1 'ditutel18-h323'=1 'ditutel19-h323'=1 'ditutel2-h323'=2 > 'ditutel20-h323'=1 'ditutel3-h323'=1 'ditutel4-h323'=3 'ditutel5-h323'=1 > 'ditutel6-h323'=1 'ditutel7-h323'=1 'ipcrossing-h323'=1 > 'netglobalis-h323'=2 'planetit2-h323'=3 'planetit3-h323'=4 > 'planetit4-h323'=3 'silvertech-h323'=18 'technosphere-h323'=2 > 'technosphere2-h323'=1 'tseyva-h323'=3 > > In the perfdata-service.log file the perfdata is truncated like it: > > 1126260797 localhost DETAIL Sessions in: 17 - Sessions out: > 54 OK 'EX1'=1 'EX1-h323'=13 'EX2'=1 'EX2-h323'=19 > 'Ibasis-SIP-1'=1 'Ibasis-SIP-2'=1 'ditutel10-h323'=1 'ditutel12-h323'=1 > 'ditutel13-h323'=1 'ditutel18-h323'=3 'ditutel19-h323'=1 'ditutel2-h323'=1 > 'ditutel20-h323'=2 'ditutel4-h323'=1 'ditutel5-h323'=1 'ditutel7-h323'=1 > 'ditutel9-h323'=3 'netglobalis-h323'=1 'plane > > I looked in the source code (2b4) but I didn't find anything (I'm not a > real C programmer :-( ) > > Can someone help me? Looking at checks.c, this appears to be controlled by the variable MAX_PLUGIN_OUTPUT_LENGTH -- /* get performance data (if it exists) */ strncpy(temp_plugin_output,queued_svc_msg.output,sizeof(temp_plugin_outp ut)-1); temp_plugin_output[sizeof(temp_plugin_output)-1]='\x0'; temp_ptr=strtok(temp_plugin_output,"|\n"); temp_ptr=strtok(NULL,"\n"); if(temp_ptr!=NULL){ strip(temp_ptr); strncpy(temp_service->perf_data,temp_ptr,MAX_PLUGINOUTPUT_LENGTH-1); temp_service->perf_data[MAX_PLUGINOUTPUT_LENGTH-1]='\x0'; } Which is objects.h:#define MAX_PLUGINOUTPUT_LENGTH 352 /* max. length of plugin output */ You could try increasing that value but it may have undesired effects. If you're using NSCA for passive checks you'll need to increase MAX_INPUT_BUFFER (I believe) there as well. For passive checks, be sure the value you use is less than your OS's PIPE_BUF size. You may just want to re-write your plugin to return less data per check and check for subsets of the above information. HTH, -- Marc ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jjesse at iserv.net Fri Sep 9 17:01:13 2005 From: jjesse at iserv.net (jjesse at iserv.net) Date: Fri, 9 Sep 2005 11:01:13 -0400 Subject: Host Down alert when Host is Up In-Reply-To: <20050909145148.25642.qmail@web32210.mail.mud.yahoo.com> References: <20050909145148.25642.qmail@web32210.mail.mud.yahoo.com> Message-ID: <200509091101.13405.jjesse@iserv.net> I assume you reloaded Nagios after changing the file? On Friday 09 September 2005 10:51, Sxan wrote: > The IP addresses did change, but I accounted for that > in the hosts.cfg file for this host. I'm not an expert > with Nagios so I'm not sure totally what you mean when > you're asking if I use state retention. But basically > the situation is that everything worked fine > yesterday, IP addresses were changed and the 2 servers > were put on a new vlan, but they are still up and > completely reachable by ping and whatever else. That's > why I'm kind of confused. Here's the nagios > notification if it helps at all... > > ***** Nagios ***** > > Notification Type: PROBLEM > Host: boca-srv01 > State: DOWN > Address: 10.168.0.201 > Info: CRITICAL - Plugin timed out after 10 seconds > > Date/Time: Fri Sept 9 10:47:18 EDT 2005 > > > > > > ~Jim > > --- Fred wrote: > > Yes, you are correct, I misspoke (typed?), however, > > my point > > remains that if the host-check returns a critical, > > nagios will > > treat the node as down. Whatever the command you > > are using ... > > > > "down" is a state which is defined by the > > host-check, it doesn't > > always mean the host is down, maybe its just not > > behaving as > > expected. > > > > -FredC > > > > --- Subhendu Ghosh wrote: > > > On Fri, 9 Sep 2005, Fred wrote: > > > > On my configuration, I ping some smart switches > > > > via a host-check. > > > > > > If the TTL is out of bounds according to the > > > > host-check command > > > > > > it returns a warning or critical. If it returns > > > > a critical the > > > > > > switch is reported as "down". > > > > > > > > -FredC > > > > > > check_ping only support RTA and packet loss. You > > > > must be using something > > > > > else.. > > > > > > > --- Subhendu Ghosh wrote: > > > >> On Fri, 9 Sep 2005, Sxan wrote: > > > >>> Hey everyone, > > > >>> > > > >>> We just changed the vlan that 2 servers are on > > > > in one > > > > > >>> location and all of a sudden Nagios is now > > > > reporting > > > > > >>> those servers as being down, even though I'm > > > > able to > > > > > >>> ping them fine from the Nagios server itself > > > > and > > > > > >>> everything else is functioning just fine. I'm > > > > confused > > > > > >>> as to why this would happen. Any ideas? I'm > > > > also > > > > > >>> wondering if this could be related to the TTL > > > > possibly > > > > > >>> changing? If so, does anyone know any simple > > > > way to > > > > > >>> change the ttl that Nagios looks for or maybe > > > > increase > > > > > >>> it somehow on the box? Thanks as always! > > > >>> > > > >>> ~Jim > > > >> > > > >> TTL doesn't matter. Did you change the IP addr > > > > of the servers in the > > > > > >> nagios config? Are you use state retention? > > > >> > > > >> > > > >> -- > > > >> -sg > > > > > > -- > > ------------------------------------------------------- > > > > SF.Net email is Sponsored by the Better Software > > > > Conference & EXPO > > > > > September 19-22, 2005 * San Francisco, CA * > > > > Development Lifecycle Practices > > > > > Agile & Plan-Driven Development * Managing > > > > Projects & Teams * Testing & QA > > > > > Security * Process Improvement & Measurement * > > > > http://www.sqe.com/bsce5sf > > > > > _______________________________________________ > > > Nagios-users mailing list > > > Nagios-users at lists.sourceforge.net > > https://lists.sourceforge.net/lists/listinfo/nagios-users > > > > ::: Please include Nagios version, plugin version > > > > (-v) and OS when reporting > > > > > any issue. > > > > > > ::: Messages without supporting info will risk > > > > being sent to /dev/null > > __________________________________________________ > Do You Yahoo!? > Tired of spam? Yahoo! Mail has the best spam protection around > http://mail.yahoo.com > > > ------------------------------------------------------- > SF.Net email is Sponsored by the Better Software Conference & EXPO > September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices > Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA > Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > > ::: Please include Nagios version, plugin version (-v) and OS when > ::: reporting any issue. Messages without supporting info will risk being > ::: sent to /dev/null ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mark.ahlstrom at managedmail.com Fri Sep 9 17:12:52 2005 From: mark.ahlstrom at managedmail.com (Mark Ahlstrom) Date: Fri, 09 Sep 2005 10:12:52 -0500 Subject: Event handler question Message-ID: <1126278772.27208.15.camel@mediis> No, it didn't have any special characters. The output was 6 through 10 (the numbers are the array slices in @ARGV) 0 :: OK 1 :: HARD 2 :: 3 3 :: smt 4 :: DISKS 5 :: 1126206404 6 :: All 7 :: disks 8 :: below 9 :: warning/critical 10 :: thresholds One of the odd things with this is the event handler with "$OUTPUT$" defined in the command would only execute on the recovery, even though the log showed Nagios running the command. The other odd thing was finding out nagios would execute the command even in downtime. Mark > -----Original Message----- > From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users- > admin at lists.sourceforge.net] On Behalf Of Mark Ahlstrom > Sent: Thursday, September 08, 2005 3:31 PM > To: nagios-users at lists.sourceforge.net > Subject: [Nagios-users] RE: Event handler question >=20 > Wouldn't you guess that I would figure out why the event handler wasn't > executing just after I posted. >=20 > The handler will only execute upon a recovery when I have $OUTPUT$ > defined in the command. Once I yank that out of the command definition, > the handler executes as the documentation states. I don't use event handlers but that seems awfully strange. Does the output include special characters like ', ", &, etc that aren't being quoted properly when in a non-OK state? -- Marc ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From enst at rao.elektra.ru Fri Sep 9 17:17:52 2005 From: enst at rao.elektra.ru (Evgeny Stepanov) Date: Fri, 9 Sep 2005 19:17:52 +0400 Subject: services get lost sometimes Message-ID: <322413363.20050909191752@rao.elektra.ru> hello nagios-users at lists.sourceforge.net ! Does anybody know, why my services occasionally get lost sometimes? I load service detail page with nagios web interface and it says i have total of 31 services, but really i have 68. When i refresh the page a couple of times it shows me all my 68 services. after some period of time they get lost again and after refreshing the page they appear again. What's going on? Maybe there are any cache or something? Actually those services get lost from any page i load, summary, servicegroup, whatever. Any suggestions what to check? Thanks in advance. Evgeny ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From sxanrr at yahoo.com Fri Sep 9 17:19:20 2005 From: sxanrr at yahoo.com (Sxan) Date: Fri, 9 Sep 2005 08:19:20 -0700 (PDT) Subject: Host Down alert when Host is Up In-Reply-To: <200509091101.13405.jjesse@iserv.net> References: <200509091101.13405.jjesse@iserv.net> Message-ID: <20050909151920.85924.qmail@web32211.mail.mud.yahoo.com> Yes :) --- jjesse at iserv.net wrote: > I assume you reloaded Nagios after changing the > file? > > On Friday 09 September 2005 10:51, Sxan wrote: > > The IP addresses did change, but I accounted for > that > > in the hosts.cfg file for this host. I'm not an > expert > > with Nagios so I'm not sure totally what you mean > when > > you're asking if I use state retention. But > basically > > the situation is that everything worked fine > > yesterday, IP addresses were changed and the 2 > servers > > were put on a new vlan, but they are still up and > > completely reachable by ping and whatever else. > That's > > why I'm kind of confused. Here's the nagios > > notification if it helps at all... > > > > ***** Nagios ***** > > > > Notification Type: PROBLEM > > Host: boca-srv01 > > State: DOWN > > Address: 10.168.0.201 > > Info: CRITICAL - Plugin timed out after 10 seconds > > > > Date/Time: Fri Sept 9 10:47:18 EDT 2005 > > > > > > > > > > > > ~Jim > > > > --- Fred wrote: > > > Yes, you are correct, I misspoke (typed?), > however, > > > my point > > > remains that if the host-check returns a > critical, > > > nagios will > > > treat the node as down. Whatever the command > you > > > are using ... > > > > > > "down" is a state which is defined by the > > > host-check, it doesn't > > > always mean the host is down, maybe its just not > > > behaving as > > > expected. > > > > > > -FredC > > > > > > --- Subhendu Ghosh wrote: > > > > On Fri, 9 Sep 2005, Fred wrote: > > > > > On my configuration, I ping some smart > switches > > > > > > via a host-check. > > > > > > > > If the TTL is out of bounds according to the > > > > > > host-check command > > > > > > > > it returns a warning or critical. If it > returns > > > > > > a critical the > > > > > > > > switch is reported as "down". > > > > > > > > > > -FredC > > > > > > > > check_ping only support RTA and packet loss. > You > > > > > > must be using something > > > > > > > else.. > > > > > > > > > --- Subhendu Ghosh > wrote: > > > > >> On Fri, 9 Sep 2005, Sxan wrote: > > > > >>> Hey everyone, > > > > >>> > > > > >>> We just changed the vlan that 2 servers > are on > > > > > > in one > > > > > > > >>> location and all of a sudden Nagios is now > > > > > > reporting > > > > > > > >>> those servers as being down, even though > I'm > > > > > > able to > > > > > > > >>> ping them fine from the Nagios server > itself > > > > > > and > > > > > > > >>> everything else is functioning just fine. > I'm > > > > > > confused > > > > > > > >>> as to why this would happen. Any ideas? > I'm > > > > > > also > > > > > > > >>> wondering if this could be related to the > TTL > > > > > > possibly > > > > > > > >>> changing? If so, does anyone know any > simple > > > > > > way to > > > > > > > >>> change the ttl that Nagios looks for or > maybe > > > > > > increase > > > > > > > >>> it somehow on the box? Thanks as always! > > > > >>> > > > > >>> ~Jim > > > > >> > > > > >> TTL doesn't matter. Did you change the IP > addr > > > > > > of the servers in the > > > > > > > >> nagios config? Are you use state > retention? > > > > >> > > > > >> > > > > >> -- > > > > >> -sg > > > > > > > > -- > > > > > ------------------------------------------------------- > > > > > > SF.Net email is Sponsored by the Better > Software > > > > > > Conference & EXPO > > > > > > > September 19-22, 2005 * San Francisco, CA * > > > > > > Development Lifecycle Practices > > > > > > > Agile & Plan-Driven Development * Managing > > > > > > Projects & Teams * Testing & QA > > > > > > > Security * Process Improvement & Measurement * > > > > > > http://www.sqe.com/bsce5sf > > > > > > > > _______________________________________________ > > > > Nagios-users mailing list > > > > Nagios-users at lists.sourceforge.net > > > > > https://lists.sourceforge.net/lists/listinfo/nagios-users > > > > > > ::: Please include Nagios version, plugin > version > > > > > > (-v) and OS when reporting > > > > > > > any issue. > > > > > > > > ::: Messages without supporting info will risk > > > > > > being sent to /dev/null > > > > __________________________________________________ > > Do You Yahoo!? > > Tired of spam? Yahoo! Mail has the best spam > protection around > > http://mail.yahoo.com > > > > > > > ------------------------------------------------------- > > SF.Net email is Sponsored by the Better Software > Conference & EXPO > > September 19-22, 2005 * San Francisco, CA * > Development === message truncated === ______________________________________________________ Click here to donate to the Hurricane Katrina relief effort. http://store.yahoo.com/redcross-donate3/ ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From davidj at synaq.com Fri Sep 9 17:17:00 2005 From: davidj at synaq.com (David Jacobson) Date: Fri, 09 Sep 2005 17:17:00 +0200 Subject: services get lost sometimes In-Reply-To: <322413363.20050909191752@rao.elektra.ru> References: <322413363.20050909191752@rao.elektra.ru> Message-ID: <1126279020.6069.43.camel@jakes.synaq.com> Hi Evgeny, You have multiple instances of Nagios running, make sure you kill all nagios processes and start it up again. Regards, David On Fri, 2005-09-09 at 19:17 +0400, Evgeny Stepanov wrote: > hello nagios-users at lists.sourceforge.net ! > > Does anybody know, why my services occasionally get lost sometimes? I > load service detail page with nagios web interface and it says i have > total of 31 services, but really i have 68. When i refresh the page a > couple of times it shows me all my 68 services. after some period of > time they get lost again and after refreshing the page they appear > again. > > What's going on? Maybe there are any cache or something? Actually > those services get lost from any page i load, summary, servicegroup, > whatever. Any suggestions what to check? > > Thanks in advance. > Evgeny > > > > ------------------------------------------------------- > SF.Net email is Sponsored by the Better Software Conference & EXPO > September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices > Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA > Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null -- Regards, David Jacobson Technical Director SYNAQ (Pty) Ltd Tel: 011 245 5888 Direct: 011 245 5889 Fax: 011 783 9275 Cell: 083 235 0760 Mail: davidj at synaq.com Web: http://www.synaq.com Key Fingerprint 8246 FCE1 3C22 7EFB E61B 18DF 6E8B 65E8 BD50 78A1 -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 189 bytes Desc: This is a digitally signed message part URL: From andrew at profitability.net Fri Sep 9 17:20:34 2005 From: andrew at profitability.net (Andrew Cruse) Date: Fri, 9 Sep 2005 11:20:34 -0400 Subject: services get lost sometimes In-Reply-To: <322413363.20050909191752@rao.elektra.ru> References: <322413363.20050909191752@rao.elektra.ru> Message-ID: nagios-users-admin at lists.sourceforge.net wrote: > hello nagios-users at lists.sourceforge.net ! > > Does anybody know, why my services occasionally get lost > sometimes? I load service detail page with nagios web > interface and it says i have total of 31 services, but really > i have 68. When i refresh the page a couple of times it shows > me all my 68 services. after some period of time they get > lost again and after refreshing the page they appear again. > > What's going on? Maybe there are any cache or something? > Actually those services get lost from any page i load, > summary, servicegroup, whatever. Any suggestions what to check? You've got two instances of Nagios running simultaneously. Shut nagios down and kill any extra nagios processes that are left running and then start Nagios fresh. Andrew ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From enst at rao.elektra.ru Fri Sep 9 17:32:51 2005 From: enst at rao.elektra.ru (Evgeny Stepanov) Date: Fri, 9 Sep 2005 19:32:51 +0400 Subject: services get lost sometimes In-Reply-To: References: <322413363.20050909191752@rao.elektra.ru> Message-ID: <61341894.20050909193251@rao.elektra.ru> hello! AC> You've got two instances of Nagios running simultaneously. Shut nagios AC> down and kill any extra nagios processes that are left running and then AC> start Nagios fresh. AC> Andrew wow! this confused me a bit... i mean how come... the start script in my rc.d really starts two instances of nagios... will check why actually the more i'm knowing nagios the more i love it. it's really nice software! best regards Evgeny ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Pavel.Santos at opm.gov Fri Sep 9 17:37:08 2005 From: Pavel.Santos at opm.gov (Santos, Pavel) Date: Fri, 9 Sep 2005 11:37:08 -0400 Subject: testing Message-ID: This is a test ------------------------------- -- Even though this E-Mail has been scanned and found clean of -- known viruses, OPM can not guarantee this message is virus free. ------------------------------- -- This message was automatically generated. -------------------------------oo -------------- next part -------------- An HTML attachment was scrubbed... URL: From bobby.bradshaw at gmail.com Fri Sep 9 17:38:53 2005 From: bobby.bradshaw at gmail.com (Bobby Bradshaw) Date: Fri, 9 Sep 2005 11:38:53 -0400 Subject: nagios.cmd file Message-ID: <591f1911050909083829bf93c1@mail.gmail.com> Hi all! I cannot locate my nagios.cmd file in my Nagios server. My Nagios setup is: Suse 9.3 Professional Nagios v1.2 Nagios plugins v1.4 Nagios nrpe v2.0 I want to execute remote commands, i.e. schedule downtime, disable notifications, etc. But, I cannot locate my nagios.cmd file anywhere. Any assistance is greatly appreciated. -- Bobby Bradshaw, Jr. bobby.bradshaw at gmail.com ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From chris at aidworld.org Fri Sep 9 17:52:07 2005 From: chris at aidworld.org (Chris Wilson) Date: Fri, 09 Sep 2005 16:52:07 +0100 Subject: Setting env variables from misc commands (email from field) In-Reply-To: References: <1126195564.31899.38.camel@localhost> Message-ID: <1126276841.13343.327.camel@localhost> Hi Blake, > I know nullmailer doesn't ignore the USER variable as I can do it from > command line. > > Hence why I asked it as a nagios question, because it seems the misc > commands ignores multiple commands seperated by ; or it just doesn't > read from ENV variables. It probably does not support multiple commands nor setting environment variables directly. But you could try the following command: printf "%s" ... | env USER=alerts nullmailer... Alternatively, you could write a wrapper script for nullmailer that sets the address for you: #!/ bin/sh USER=alerts nullmailer ... > It's more of a Nagios question as you should be able to set the from > field in nagios. Why? Nagios doesn't know or care about emails or from fields, it just runs the command that you tell it to. Personally I like it that way. Cheers, Chris. -- (aidworld) chris wilson | chief engineer (chris at aidworld.org) ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Fri Sep 9 17:51:16 2005 From: marc at ena.com (Marc Powell) Date: Fri, 9 Sep 2005 10:51:16 -0500 Subject: nagios.cmd file Message-ID: > -----Original Message----- > From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users- > admin at lists.sourceforge.net] On Behalf Of Bobby Bradshaw > Sent: Friday, September 09, 2005 10:39 AM > To: nagios-users at lists.sourceforge.net > Subject: [Nagios-users] nagios.cmd file > > Hi all! > > I cannot locate my nagios.cmd file in my Nagios server. Look at the value of command_file in nagios.cfg. Of course, that's not enough to enable processing of external commands but that will tell you where the file will be located if external command processing is enabled. > > My Nagios setup is: > Suse 9.3 Professional > Nagios v1.2 > Nagios plugins v1.4 > Nagios nrpe v2.0 > > I want to execute remote commands, i.e. schedule downtime, disable > notifications, etc. But, I cannot locate my nagios.cmd file anywhere. > > Any assistance is greatly appreciated. Documentation is a good place to start -- http://nagios.sourceforge.net/docs/1_0/extcommands.html -- Marc ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ygonzales at medassets.com Fri Sep 9 17:57:52 2005 From: ygonzales at medassets.com (Gonzales, Youn) Date: Fri, 9 Sep 2005 10:57:52 -0500 Subject: Monitoring Nagios Services Message-ID: <99CF04974931C548B2BF20CD898FCBA7B75330@uscpgmedexch01.medassets.com> First, I would like to say how impressed we are with Nagios and PerfParse. We currently have them running on a 3 1/2 year old decommissioned laptop and we have been able to complete our testing. Does anyone have any suggestions for how to monitor the nagios services and restart them if necessary? Youn Gonzales Network Engineer MedAssets Supply Chain Systems 280 S Mount Auburn Rd Cape Girardeau, MO 63703 (573) 332-2285 Phone (573) 332-2300 Fax ygonzales at medassets.com "The information transmitted is intended only for the person or entity to which it is addressed and may contain confidential, proprietary, and/or privileged material. Any review, retransmission, dissemination or other use of, or taking of any action in reliance upon this information by persons or entities other than the intended recipient is prohibited. If you received this in error, please contact the sender and delete the material from all computers" ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From james.mohr at elaxy.com Fri Sep 9 17:56:37 2005 From: james.mohr at elaxy.com (Mohr James) Date: Fri, 9 Sep 2005 17:56:37 +0200 Subject: Notification of volatile acknowledged services Message-ID: Greetings! We are using Nagios as our central management software and it will be receiving messages from a couple of other software packages. Ever node is more or less monitored using both Nagios and (at least) SolarWinds. When SolarWinds detects a problem, such as a full hard disk, it sends an generates an alert which sends a message to the Nagios machine via send_nsca. This should then send out an SMS. Rather than having to create separate services for each node, we have created a generic service "Send SMS" and defined it as volatile. So each time the SolarWinds machine sends an alert, the Nagios machine should send an SMS. It seems that no SMS is sent if the state has been acknowledged. On the one had this makes sense. If the condition is being "worked on", then there is no need to send an SMS. However, we would like an SMS sent every time. The first question is whether or not I have interpreted it correctly that the acknowledgement disables the notification. I looked through the "Notifications" page in the Nagios doc, but did not find anything that mentions this. The second question is whether anyone has an idea how we can simply get Nagios to pump through the notifications, regardless of the state of the services. I had thought about creating something for the Event Broker, but that seems like a bit too extreme. Any help is greatly appreciated. Regards, Jim Mohr ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From bobby.bradshaw at gmail.com Fri Sep 9 17:58:59 2005 From: bobby.bradshaw at gmail.com (Bobby Bradshaw) Date: Fri, 9 Sep 2005 11:58:59 -0400 Subject: nagios.cmd file In-Reply-To: References: Message-ID: <591f1911050909085823579045@mail.gmail.com> I looked at the reference location in the nagios.cfg file already, which is /var/spool/nagios/nagios.cmd and it is not there. I also looked anywhere on my nagios server where Nagios is referenced and ....no nagios.cmd file. Nagios documentation says that that file is created on nagios startup, then deleted on shutdown. Well, my Nagios is running fine, but can't find nagios.cmd file. On 9/9/05, Marc Powell wrote: > > > > -----Original Message----- > > From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users- > > admin at lists.sourceforge.net] On Behalf Of Bobby Bradshaw > > Sent: Friday, September 09, 2005 10:39 AM > > To: nagios-users at lists.sourceforge.net > > Subject: [Nagios-users] nagios.cmd file > > > > Hi all! > > > > I cannot locate my nagios.cmd file in my Nagios server. > > Look at the value of command_file in nagios.cfg. Of course, that's not > enough to enable processing of external commands but that will tell you > where the file will be located if external command processing is > enabled. > > > > > My Nagios setup is: > > Suse 9.3 Professional > > Nagios v1.2 > > Nagios plugins v1.4 > > Nagios nrpe v2.0 > > > > I want to execute remote commands, i.e. schedule downtime, disable > > notifications, etc. But, I cannot locate my nagios.cmd file anywhere. > > > > Any assistance is greatly appreciated. > > Documentation is a good place to start -- > > http://nagios.sourceforge.net/docs/1_0/extcommands.html > > -- > Marc > > > ------------------------------------------------------- > SF.Net email is Sponsored by the Better Software Conference & EXPO > September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices > Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA > Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -- Bobby Bradshaw, Jr. bobby.bradshaw at gmail.com ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Fri Sep 9 18:12:43 2005 From: marc at ena.com (Marc Powell) Date: Fri, 9 Sep 2005 11:12:43 -0500 Subject: nagios.cmd file Message-ID: > -----Original Message----- > From: Bobby Bradshaw [mailto:bobby.bradshaw at gmail.com] > Sent: Friday, September 09, 2005 10:59 AM > To: Marc Powell > Cc: nagios-users at lists.sourceforge.net > Subject: Re: [Nagios-users] nagios.cmd file > > I looked at the reference location in the nagios.cfg file already, > which is /var/spool/nagios/nagios.cmd and it is not there. I also > looked anywhere on my nagios server where Nagios is referenced and > ....no nagios.cmd file. Can nagios write to that location? > > Nagios documentation says that that file is created on nagios startup, > then deleted on shutdown. Well, my Nagios is running fine, but can't > find nagios.cmd file. Did you verify the other required configuration options are enabled/correct as per the documentation reference I included? -- Marc > > > On 9/9/05, Marc Powell wrote: > > > > > > > -----Original Message----- > > > From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users- > > > admin at lists.sourceforge.net] On Behalf Of Bobby Bradshaw > > > Sent: Friday, September 09, 2005 10:39 AM > > > To: nagios-users at lists.sourceforge.net > > > Subject: [Nagios-users] nagios.cmd file > > > > > > Hi all! > > > > > > I cannot locate my nagios.cmd file in my Nagios server. > > > > Look at the value of command_file in nagios.cfg. Of course, that's not > > enough to enable processing of external commands but that will tell you > > where the file will be located if external command processing is > > enabled. > > > > > > > > My Nagios setup is: > > > Suse 9.3 Professional > > > Nagios v1.2 > > > Nagios plugins v1.4 > > > Nagios nrpe v2.0 > > > > > > I want to execute remote commands, i.e. schedule downtime, disable > > > notifications, etc. But, I cannot locate my nagios.cmd file anywhere. > > > > > > Any assistance is greatly appreciated. > > > > Documentation is a good place to start -- > > > > http://nagios.sourceforge.net/docs/1_0/extcommands.html ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Fri Sep 9 18:10:04 2005 From: marc at ena.com (Marc Powell) Date: Fri, 9 Sep 2005 11:10:04 -0500 Subject: Notification of volatile acknowledged services Message-ID: > -----Original Message----- > From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users- > admin at lists.sourceforge.net] On Behalf Of Mohr James > Sent: Friday, September 09, 2005 10:57 AM > To: nagios-users at lists.sourceforge.net > Subject: [Nagios-users] Notification of volatile acknowledged services > > Greetings! > > We are using Nagios as our central management software and it will be > receiving messages from a couple of other software packages. Ever node > is more or less monitored using both Nagios and (at least) SolarWinds. > When SolarWinds detects a problem, such as a full hard disk, it sends an > generates an alert which sends a message to the Nagios machine via > send_nsca. This should then send out an SMS. > > Rather than having to create separate services for each node, we have > created a generic service "Send SMS" and defined it as volatile. So each > time the SolarWinds machine sends an alert, the Nagios machine should > send an SMS. It seems that no SMS is sent if the state has been > acknowledged. On the one had this makes sense. If the condition is being > "worked on", then there is no need to send an SMS. However, we would > like an SMS sent every time. > > The first question is whether or not I have interpreted it correctly > that the acknowledgement disables the notification. I looked through the > "Notifications" page in the Nagios doc, but did not find anything that > mentions this. Yes, it's actually documented on the acknowledgment page itself -- "This command is used to acknowledge a service problem. When a service problem is acknowledged, future notifications about problems are temporarily disabled until the service changes state (i.e. recovers)." > > The second question is whether anyone has an idea how we can simply get > Nagios to pump through the notifications, regardless of the state of the > services. I had thought about creating something for the Event Broker, > but that seems like a bit too extreme. Don't use the acknowledge feature. Nagios will send alerts for every non-OK state with is_volatile enabled IFF you don't acknowledge it. I think if you can change your process around acknowledgments you'll get what you want. Alternately you could create an event handler to send the alerts if state != OK. -- Marc ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From JF.Leblond at SAQ.qc.ca Fri Sep 9 18:34:30 2005 From: JF.Leblond at SAQ.qc.ca (=?iso-8859-1?Q?Leblond=2C_Jean-Fran=E7ois?=) Date: Fri, 9 Sep 2005 12:34:30 -0400 Subject: Not able to get mail notification , Guidance requested Message-ID: Hi, I have a similar problem but with Nagios 2.0 under AIX 5.2 Mail are sent fine as well but Nagios doesn't send anything Contacts.cfg: define contact{ contact_name nagios-admin alias Nagios Admin service_notification_period 24x7 host_notification_period 24x7 service_notification_options w,u,c,r host_notification_options d,r service_notification_commands notify-by-email host_notification_commands host-notify-by-email email support.aix AT saq.qc.ca } Contactgroups.cfg: define contactgroup{ contactgroup_name admins alias Nagios Administrators members nagios-admin } Services.cfg define service{ use generic-service ; Name of service template to use # host_name * hostgroup_name aix_test_servers service_description http is_volatile 0 check_period 24x7 max_check_attempts 4 normal_check_interval 5 retry_check_interval 1 contact_groups admins notifications_enabled 1 notification_interval 960 notification_period 24x7 check_command check_http } The mail command in misccommands.cfg looks fine for this OS Thanks in advance for your help. Jean-Fran?ois Leblond jf.leblond at saq.qc.ca -----Message d'origine----- De?: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users-admin at lists.sourceforge.net] De la part de Marc Powell Envoy??: 7 septembre, 2005 11:12 ??: Nagios User Objet?: RE: [Nagios-users] Not able to get mail notification , Guidance requested > -----Original Message----- > From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users- > admin at lists.sourceforge.net] On Behalf Of John Joseph > Sent: Wednesday, September 07, 2005 12:36 AM > To: Nagios User > Subject: [Nagios-users] Not able to get mail notification , Guidance > requested > > Hi team > I have RHEL4 , nagios 1.2 , postfix , in which MTA is > working fine > I had made contact , contact group , on contact > details , I had given "host-notify-by-email and > notify-by-email " for host_notification_ command and > service_notification_command > and for that host > > > I have defined host " joseph" as > > Host "joseph" > Name : joseph > Alias : Joseph-TEST > Address : 192.168.20.99 > # Parents : Router-HO > # Host Groups : HO-Server > Check_command : > Max_check_attempts : 3 > Checks_enabled : Yes > Event_handler_enabled : Nothing > Event_handler : > Low_flap_threshold : 0 % > High_flap_threshold : 0 % > Flap_detection_enabled : Nothing > Process_perf_data : Nothing > Retain_status_information : Yes > Retain_nonstatus_information : Yes > Notification_interval : 3 * 60 sec > Notification_period : 24x7 > Notification_options : d,u,r > Notifications_enabled : Yes > Stalking_options : o,d,u > Status : Enabled > > For Services > I have defined FTP as > > Host name : joseph > Description : FTP Check > Is Volatile : Nothing > # Service Groups : ForServieGroup > Check_command : check_ftp > Check_command_arguments : > Max_check_attempts : 3 > Normal_check_interval : 3 * 60 sec > Retry_check_interval : 3 * 60 sec > Active_checks_enabled : Nothing > Passive_checks_enabled : Nothing > Check_period : 24x7 > Parallelize_check : Nothing > Obsess_over_service : Nothing > Check_freshness : Nothing > Freshness treshold : 0 sec > Event_handler : > Event_handler_arguments : > Event_handler enabled : Nothing > Low flap treshold : 0 % > High flap treshold : 0 % > Flap_detection_enabled : Nothing > Process_perf_data : Nothing > Retain_status_information : Nothing > Retain_nonstatus_information : Nothing > Notification_interval : 3 * 60 sec > Notification_period : 24x7 > Notification_options : w,u,c,r > Notification_enabled : Yes > # Contact Groups : IT-Support > Stalking_options : o,w,u,c > Status : Enabled > > > I am not able to get the mail notification , when > there is change for the ftp ie when FTP is stopped or > start The above information indicates that you don't have active or passive checks enabled for the service. Is that the case? If so, you're not checking the service so no notifications will ever go out. If you are checking the service and the information above is incorrect, check nagios.log for a notification attempt. Check your postfix logs. Verify that you can send a notification by issuing your notification commands exactly as they are defined as the nagios user (not root!) - post the test here. If you still have problems, please post the exact host and service definitions as well as your notification commands to this list - the information above is not them. Nagios.log entries around the time that the notification should happen would be useful as well. -- Marc ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From andrew at 2sheds.de Fri Sep 9 19:09:52 2005 From: andrew at 2sheds.de (Andrew Miehs) Date: Fri, 9 Sep 2005 19:09:52 +0200 Subject: Nagios_Grapher Message-ID: Dear list, I was looking for how I can contact the authors of Nagios Grapher... In the readme - they mentioned that they are reachable here. I have a problem with v1.3 that it doesn't seem to want to create the rrd files correctly - it does for ping - but load, etc aren't created due to the 'create' statement missing data.... 2005-09-09 19:03:12 RRD: rrdtool create /var/lib/nagios/rrd/localhost/ d9d37ec8e402b4c235e7b1b1e8214045.rrd --step= RRA:AVERAGE:0.5:5: RRA:MAX:0.5:5: RRA:MIN:0.5:5: RRA:AVERAGE:0.5:30: RRA:MAX:0.5:30: RRA:MIN:0.5:30:RRA:AVERAGE:0.5:120: RRA:MAX:0.5:120: RRA:MIN:0.5:120: RRA:AVERAGE:0.5:1440: RRA:MAX:0.5:1440: RRA:MIN:0.5:1440: 2005-09-09 19:03:12 RRD: [localhost][Current Users]:step size should be no less than one second 2005-09-09 17:34:07 RRD: rrdtool create /var/lib/nagios/rrd/localhost/ be0dcf79990574e9c88ba5325be4dcc6.rrd --step= RRA:AVERAGE:0.5:5: RRA:MAX:0.5:5: RRA:MIN:0.5:5: RRA:AVERAGE:0.5:30: RRA:MAX:0.5:30: RRA:MIN:0.5:30:RRA:AVERAGE:0.5:120: RRA:MAX:0.5:120: RRA:MIN:0.5:120: RRA:AVERAGE:0.5:1440: RRA:MAX:0.5:1440: RRA:MIN:0.5:1440: 2005-09-09 17:34:07 RRD: [localhost][Current Load]:step size should be no less than one second Ping however did work - Regards Andrew ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From sxanrr at yahoo.com Fri Sep 9 19:37:40 2005 From: sxanrr at yahoo.com (Sxan) Date: Fri, 9 Sep 2005 10:37:40 -0700 (PDT) Subject: Host Down alert when Host is Up - SOLVED In-Reply-To: <20050909140500.40012.qmail@web31907.mail.mud.yahoo.com> References: <20050909140500.40012.qmail@web31907.mail.mud.yahoo.com> Message-ID: <20050909173740.28990.qmail@web32203.mail.mud.yahoo.com> Ok, I figured out what was causing this to happen. It turned out that one of my custom check commands had the old IP address to use, so when it was not able to successfully run the check command, it was reporting the box as being down. I don't know why it would do that as opposed to just saying the application wasn't there, but I guess thats how Nagios works. If it goes to check a service and can't reach the host, a host down state is assumed. Thanks for the responses as always. ~Jim --- Fred wrote: > On my configuration, I ping some smart switches via > a host-check. > If the TTL is out of bounds according to the > host-check command > it returns a warning or critical. If it returns a > critical the > switch is reported as "down". > > -FredC > > --- Subhendu Ghosh wrote: > > > On Fri, 9 Sep 2005, Sxan wrote: > > > > > Hey everyone, > > > > > > We just changed the vlan that 2 servers are on > in one > > > location and all of a sudden Nagios is now > reporting > > > those servers as being down, even though I'm > able to > > > ping them fine from the Nagios server itself and > > > everything else is functioning just fine. I'm > confused > > > as to why this would happen. Any ideas? I'm also > > > wondering if this could be related to the TTL > possibly > > > changing? If so, does anyone know any simple way > to > > > change the ttl that Nagios looks for or maybe > increase > > > it somehow on the box? Thanks as always! > > > > > > ~Jim > > > > > > > TTL doesn't matter. Did you change the IP addr of > the servers in the > > nagios config? Are you use state retention? > > > > > > -- > > -sg > > > > > > > ------------------------------------------------------- > > SF.Net email is Sponsored by the Better Software > Conference & EXPO > > September 19-22, 2005 * San Francisco, CA * > Development Lifecycle Practices > > Agile & Plan-Driven Development * Managing > Projects & Teams * Testing & QA > > Security * Process Improvement & Measurement * > http://www.sqe.com/bsce5sf > > _______________________________________________ > > Nagios-users mailing list > > Nagios-users at lists.sourceforge.net > > > https://lists.sourceforge.net/lists/listinfo/nagios-users > > ::: Please include Nagios version, plugin version > (-v) and OS when reporting > > any issue. > > ::: Messages without supporting info will risk > being sent to /dev/null > > > > > > > > __________________________________________________ Do You Yahoo!? Tired of spam? Yahoo! Mail has the best spam protection around http://mail.yahoo.com ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From james.mohr at elaxy.com Fri Sep 9 19:40:46 2005 From: james.mohr at elaxy.com (Mohr James) Date: Fri, 9 Sep 2005 19:40:46 +0200 Subject: AW: Notification of volatile acknowledged services Message-ID: Hi Mark! Thanks for the help. I think you are right about changing our process. When the boss wants something *his* way, you gotta try to figure a way of doing it or prove to him that it can't be done. (Which ain't always easy). Regards, jimmo -----Urspr?ngliche Nachricht----- Von: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users-admin at lists.sourceforge.net] Im Auftrag von Marc Powell Gesendet: Freitag, 9. September 2005 18:10 An: nagios-users at lists.sourceforge.net Betreff: RE: [Nagios-users] Notification of volatile acknowledged services > -----Original Message----- > From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users- > admin at lists.sourceforge.net] On Behalf Of Mohr James > Sent: Friday, September 09, 2005 10:57 AM > To: nagios-users at lists.sourceforge.net > Subject: [Nagios-users] Notification of volatile acknowledged services > > Greetings! > > We are using Nagios as our central management software and it will be > receiving messages from a couple of other software packages. Ever node > is more or less monitored using both Nagios and (at least) SolarWinds. > When SolarWinds detects a problem, such as a full hard disk, it sends an > generates an alert which sends a message to the Nagios machine via > send_nsca. This should then send out an SMS. > > Rather than having to create separate services for each node, we have > created a generic service "Send SMS" and defined it as volatile. So each > time the SolarWinds machine sends an alert, the Nagios machine should > send an SMS. It seems that no SMS is sent if the state has been > acknowledged. On the one had this makes sense. If the condition is being > "worked on", then there is no need to send an SMS. However, we would > like an SMS sent every time. > > The first question is whether or not I have interpreted it correctly > that the acknowledgement disables the notification. I looked through the > "Notifications" page in the Nagios doc, but did not find anything that > mentions this. Yes, it's actually documented on the acknowledgment page itself -- "This command is used to acknowledge a service problem. When a service problem is acknowledged, future notifications about problems are temporarily disabled until the service changes state (i.e. recovers)." > > The second question is whether anyone has an idea how we can simply get > Nagios to pump through the notifications, regardless of the state of the > services. I had thought about creating something for the Event Broker, > but that seems like a bit too extreme. Don't use the acknowledge feature. Nagios will send alerts for every non-OK state with is_volatile enabled IFF you don't acknowledge it. I think if you can change your process around acknowledgments you'll get what you want. Alternately you could create an event handler to send the alerts if state != OK. -- Marc ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From dist-list at LEXUM.UMontreal.CA Fri Sep 9 19:53:18 2005 From: dist-list at LEXUM.UMontreal.CA (FM) Date: Fri, 09 Sep 2005 13:53:18 -0400 Subject: warning using check_dns Message-ID: <4321CC0E.6020305@lexum.umontreal.ca> Hello, Nagios 2.x I'm trying to use check_dns : [trieste plugins]# ./check_dns -H urbino.lan.lexum.pri -s dns1.lan.lexum.pri DNS OK: 0.239 seconds response time urbino.lan.lexum.pri returns 192.168.4.10|time=0.238538s;;;0.000000 [trieste plugins]# ./check_dns -H urbino.lan.lexum.pri -s dns1.lan.lexum.pri DNS WARNING - nslookup returned error status As you can see it worked at the first attempt but not at the second. How to you handle dns checking ? Thanks !!! ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From chroot at wells.vg Fri Sep 9 20:13:59 2005 From: chroot at wells.vg (Matt Wells) Date: Fri, 9 Sep 2005 11:13:59 -0700 Subject: NRPE SSL Error Message-ID: <174e4d9d1900ac9dadcd08a1e40263a8@open.wells.vg> I keep getting this error on all my servers. nrpe[23880]: Error: Could not complete SSL handshake. 5 Can anyone shed any light on this?? I've spent many hours on google on this but found nothing.? So here's the entire thing. My Server is running Nagios and Nagios only.? It is using the check_nrpe command to attach to other servers. The other servers are running nrpe client that I compiled once on the Nagios server, tar the nrpe, nrpe.cfg and the libexec and copied from server to server.? It is run as a dameon and is reporting data back to the nagios server.? So it appears on everything other than the /var/log/messages that all is well.? TCPDUMP shows the data as being encrypted.? I have permissions to all the nrpe files on the client side.? Am I missing something? -------------- next part -------------- An HTML attachment was scrubbed... URL: From corwin at aeternal.net Fri Sep 9 20:14:54 2005 From: corwin at aeternal.net (martin hudec) Date: Fri, 9 Sep 2005 20:14:54 +0200 Subject: NRPE SSL Error In-Reply-To: <174e4d9d1900ac9dadcd08a1e40263a8@open.wells.vg> References: <174e4d9d1900ac9dadcd08a1e40263a8@open.wells.vg> Message-ID: <20050909181454.GE27359@pleiades.aeternal.net> Hello, On Fri, Sep 09, 2005 at 11:13:59AM -0700 or thereabouts, Matt Wells wrote: > I keep getting this error on all my servers. > nrpe[23880]: Error: Could not complete SSL handshake. 5 > Can anyone shed any light on this?? > I've spent many hours on google on this but found nothing.? So > here's the entire thing. > My Server is running Nagios and Nagios only.? It is using the > check_nrpe command to attach to other servers. > The other servers are running nrpe client that I compiled once on the > Nagios server, tar the nrpe, nrpe.cfg and the libexec and copied from > server to server.? It is run as a dameon and is reporting data back > to the nagios server.? So it appears on everything other than the > /var/log/messages that all is well.? TCPDUMP shows the data as being > encrypted.? I have permissions to all the nrpe files on the client > side.? Am I missing something? What kind of OS are you using? What kind of Nagios nrpe are you using on both client (nagios server) side and on server (nrpe daemon on monitored server) side? I had same problem before, and it was due to nrpe v2 on monitored servers and its requirement to encrypt data with ssl. So I compiled nrpe v2 with ssl enabled on both sides and it is running. Cheers, Martin -- martin hudec * 421 907 303 393 * corwin at aeternal.net * http://www.aeternal.net "Nothing travels faster than the speed of light with the possible exception of bad news, which obeys its own special laws." Douglas Adams, "The Hitchhiker's Guide to the Galaxy" -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: application/pgp-signature Size: 187 bytes Desc: not available URL: From RLAdams at Kelsey-Seybold.com Fri Sep 9 20:34:50 2005 From: RLAdams at Kelsey-Seybold.com (Adams, Russell L.) Date: Fri, 9 Sep 2005 13:34:50 -0500 Subject: Monitoring Nagios Services In-Reply-To: <99CF04974931C548B2BF20CD898FCBA7B75330@uscpgmedexch01.medasset s.com> References: <99CF04974931C548B2BF20CD898FCBA7B75330@uscpgmedexch01.medassets .com> Message-ID: <20050909183450.GB32626@pingu.ksnet.com> Make nagios a child of init in inittab, set to respawn. Russell On Fri, Sep 09, 2005 at 10:57:52AM -0500, Gonzales, Youn wrote: > First, I would like to say how impressed we are with Nagios and > PerfParse. We currently have them running on a 3 1/2 year old > decommissioned laptop and we have been able to complete our testing. > > Does anyone have any suggestions for how to monitor the nagios services > and restart them if necessary? > > Youn Gonzales > Network Engineer > MedAssets Supply Chain Systems > 280 S Mount Auburn Rd > Cape Girardeau, MO 63703 > (573) 332-2285 Phone > (573) 332-2300 Fax > ygonzales at medassets.com > > "The information transmitted is intended only for the person or entity to which it is addressed and may contain confidential, proprietary, and/or privileged material. Any review, retransmission, dissemination or other use of, or taking of any action in reliance upon this information by persons or entities other than the intended recipient is prohibited. If you received this in error, please contact the sender and delete the material from all computers" > > > > > ------------------------------------------------------- > SF.Net email is Sponsored by the Better Software Conference & EXPO > September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices > Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA > Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From chroot at wells.vg Fri Sep 9 20:52:15 2005 From: chroot at wells.vg (Matt Wells) Date: Fri, 9 Sep 2005 11:52:15 -0700 Subject: NRPE SSL Error Message-ID: I am running Redhat.? I compiled the v2 on the server and copied the check_nrpe to the libexec dir.? This is from the same compile as the clients.? Other than the check_nrpe does the Nagios Server need anything else running that perhaps I've not run?? The clients have the /usr/local/nagios/nrpe -c /usr/local/nagios/nrpe.cfg -d for the client. The clients are in /usr/local/nagios and within this folder is nrpe nrpe.cfg libexec/ libexec/check_lots_of_checks_here ----- Original Message ----- SUBJECT:?Re: [Nagios-users] NRPE SSL Error FROM: ?martin hudec TO:?nagios-users at lists.sourceforge.net DATE:?09-09-2005 11:18 Hello, On Fri, Sep 09, 2005 at 11:13:59AM -0700 or thereabouts, Matt Wells wrote: > I keep getting this error on all my servers. > nrpe[23880]: Error: Could not complete SSL handshake. 5 > Can anyone shed any light on this?? > I've spent many hours on google on this but found nothing. So > here's the entire thing. > My Server is running Nagios and Nagios only. It is using the > check_nrpe command to attach to other servers. > The other servers are running nrpe client that I compiled once on the > Nagios server, tar the nrpe, nrpe.cfg and the libexec and copied from > server to server. It is run as a dameon and is reporting data back > to the nagios server. So it appears on everything other than the > /var/log/messages that all is well. TCPDUMP shows the data as being > encrypted. I have permissions to all the nrpe files on the client > side. Am I missing something? What kind of OS are you using? What kind of Nagios nrpe are you using on both client (nagios server) side and on server (nrpe daemon on monitored server) side? I had same problem before, and it was due to nrpe v2 on monitored servers and its requirement to encrypt data with ssl. So I compiled nrpe v2 with ssl enabled on both sides and it is running. Cheers, Martin -- martin hudec * 421 907 303 393 * corwin at aeternal.net * http://www.aeternal.net "Nothing travels faster than the speed of light with the possible exception of bad news, which obeys its own special laws." Douglas Adams, "The Hitchhiker's Guide to the Galaxy" -------------- next part -------------- An HTML attachment was scrubbed... URL: From corwin at aeternal.net Fri Sep 9 20:48:57 2005 From: corwin at aeternal.net (martin hudec) Date: Fri, 9 Sep 2005 20:48:57 +0200 Subject: NRPE SSL Error In-Reply-To: <5c877ba4d93c029fad57df3b0ef2d6db@open.wells.vg> References: <5c877ba4d93c029fad57df3b0ef2d6db@open.wells.vg> Message-ID: <20050909184857.GF27359@pleiades.aeternal.net> Hello, On Fri, Sep 09, 2005 at 11:33:26AM -0700 or thereabouts, Matt Wells wrote: > Martin thank you for such a fast reply.? I compiled the v2 on the > server and copied the check_nrpe to the libexec dir.? This is from > the same compile as the clients.? Other than the check_nrpe does the > Nagios Server need anything else running that perhaps I've not run?? > The clients have the /usr/local/nagios/nrpe -c > /usr/local/nagios/nrpe.cfg -d for the client. > The clients are in /usr/local/nagios and within this folder is > nrpe > nrpe.cfg > libexec/ > libexec/check_lots_of_checks_here Well I am actually interested in way how was nrpe compiled (you said that all nrpe are from the same compile), so look for compile options.. This was exactly what I was missing (to enable ssl during configuration before compilation).. I am using FreeBSD, so I used WITH_SSL=yes as compile option for nrpe port. So check your operating system and its package system for any configurable options for nrpe package. I am leaving from work, so I won't be online until morning. I hope my few lines will actually do some help. Cheers, Martin -- martin hudec * 421 907 303 393 * corwin at aeternal.net * http://www.aeternal.net "Nothing travels faster than the speed of light with the possible exception of bad news, which obeys its own special laws." Douglas Adams, "The Hitchhiker's Guide to the Galaxy" -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: application/pgp-signature Size: 187 bytes Desc: not available URL: From davea at support.kcm.org Fri Sep 9 21:00:24 2005 From: davea at support.kcm.org (Dave Augustus) Date: Fri, 09 Sep 2005 14:00:24 -0500 Subject: warning using check_dns In-Reply-To: <4321CC0E.6020305@lexum.umontreal.ca> References: <4321CC0E.6020305@lexum.umontreal.ca> Message-ID: <1126292424.26964.4.camel@kcm40202> I too have had problems with this plugin. I have posted to this list and got some replies but nothing ever really seem to fix the problem. I resorted to using the check_dns.pl script instead of check_dns. Dave On Fri, 2005-09-09 at 13:53 -0400, FM wrote: > Hello, > > Nagios 2.x > > I'm trying to use check_dns : > [trieste plugins]# ./check_dns -H urbino.lan.lexum.pri -s dns1.lan.lexum.pri > DNS OK: 0.239 seconds response time urbino.lan.lexum.pri returns > 192.168.4.10|time=0.238538s;;;0.000000 > > [trieste plugins]# ./check_dns -H urbino.lan.lexum.pri -s dns1.lan.lexum.pri > DNS WARNING - nslookup returned error status > > As you can see it worked at the first attempt but not at the second. > > How to you handle dns checking ? > > Thanks !!! > > > ------------------------------------------------------- > SF.Net email is Sponsored by the Better Software Conference & EXPO > September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices > Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA > Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null -- Dave Augustus ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From chroot at wells.vg Fri Sep 9 21:04:45 2005 From: chroot at wells.vg (Matt Wells) Date: Fri, 9 Sep 2005 12:04:45 -0700 Subject: NRPE SSL Error Message-ID: <3547eb108fe6e419ee7a21adcb6ba471@open.wells.vg> By default now it compiles with SSL enabled.. at least it said so in the configure file. ----- Original Message ----- SUBJECT:?Re: [Nagios-users] NRPE SSL Error FROM: ?martin hudec TO:?"Matt Wells" CC:?nagios-users at lists.sourceforge.net DATE:?09-09-2005 11:51 Hello, On Fri, Sep 09, 2005 at 11:33:26AM -0700 or thereabouts, Matt Wells wrote: > Martin thank you for such a fast reply. I compiled the v2 on the > server and copied the check_nrpe to the libexec dir. This is from > the same compile as the clients. Other than the check_nrpe does the > Nagios Server need anything else running that perhaps I've not run? > The clients have the /usr/local/nagios/nrpe -c > /usr/local/nagios/nrpe.cfg -d for the client. > The clients are in /usr/local/nagios and within this folder is > nrpe > nrpe.cfg > libexec/ > libexec/check_lots_of_checks_here Well I am actually interested in way how was nrpe compiled (you said that all nrpe are from the same compile), so look for compile options.. This was exactly what I was missing (to enable ssl during configuration before compilation).. I am using FreeBSD, so I used WITH_SSL=yes as compile option for nrpe port. So check your operating system and its package system for any configurable options for nrpe package. I am leaving from work, so I won't be online until morning. I hope my few lines will actually do some help. Cheers, Martin -- martin hudec * 421 907 303 393 * corwin at aeternal.net * http://www.aeternal.net "Nothing travels faster than the speed of light with the possible exception of bad news, which obeys its own special laws." Douglas Adams, "The Hitchhiker's Guide to the Galaxy" -------------- next part -------------- An HTML attachment was scrubbed... URL: From robmossrm at aol.com Fri Sep 9 21:23:06 2005 From: robmossrm at aol.com (Rob Moss) Date: Fri, 09 Sep 2005 20:23:06 +0100 Subject: NRPE SSL Error In-Reply-To: <3547eb108fe6e419ee7a21adcb6ba471@open.wells.vg> References: <3547eb108fe6e419ee7a21adcb6ba471@open.wells.vg> Message-ID: <4321E11A.7070900@aol.com> Matt Wells wrote: > By default now it compiles with SSL enabled.. at least it said so in > the configure file. > > ----- Original Message ----- > Subject: Re: [Nagios-users] NRPE SSL Error > From: martin hudec > To: "Matt Wells" > CC: nagios-users at lists.sourceforge.net > Date: 09-09-2005 11:51 > > > Hello, > > On Fri, Sep 09, 2005 at 11:33:26AM -0700 or thereabouts, Matt > Wells wrote: > > Martin thank you for such a fast reply. I compiled the v2 on the > > server and copied the check_nrpe to the libexec dir. This is from > > the same compile as the clients. Other than the check_nrpe does the > > Nagios Server need anything else running that perhaps I've not run? > > The clients have the /usr/local/nagios/nrpe -c > > /usr/local/nagios/nrpe.cfg -d for the client. > > The clients are in /usr/local/nagios and within this folder is > > nrpe > > nrpe.cfg > > libexec/ > > libexec/check_lots_of_checks_here > > Well I am actually interested in way how was nrpe compiled (you said > that all nrpe are from the same compile), so look for compile > options.. This was exactly what I was missing (to enable ssl during > configuration before compilation).. I am using FreeBSD, so I used > WITH_SSL=yes as compile option for nrpe port. So check your operating > system and its package system for any configurable options for nrpe > package. > > I am leaving from work, so I won't be online until morning. I hope my > few lines will actually do some help. > Most of the SSL errors I get are related to the allowed_hosts option. Check that your Nagios server IP address is on that line (separated by commas). Also, on your nagios server, run the check_nrpe command as the nagios user with exactly the same arguments as the Nagios server would have and display the error to us. Cheers. rob. -------------- next part -------------- An HTML attachment was scrubbed... URL: From chroot at wells.vg Fri Sep 9 21:45:25 2005 From: chroot at wells.vg (Matt Wells) Date: Fri, 9 Sep 2005 12:45:25 -0700 Subject: NRPE SSL Error Message-ID: <7a8040aff959844c06e26811a354b1a0@open.wells.vg> I am able to run the command and get a perfect reply.? I get the nrpe[25392]: Error: Could not complete SSL handshake. 5 every 3 minutes on the second. ----- Original Message ----- SUBJECT:?Re: [Nagios-users] NRPE SSL Error FROM: ?Rob Moss TO:?Undisclosed reciepent CC:?"nagios-users at lists.sourceforge.net" DATE:?09-09-2005 12:26 Matt Wells wrote: By default now it compiles with SSL enabled.. at least it said so in the configure file. ----- Original Message ----- SUBJECT: Re: [Nagios-users] NRPE SSL Error FROM: martin hudec [1] TO: "Matt Wells" [2] CC: nagios-users at lists.sourceforge.net[3] DATE: 09-09-2005 11:51 Hello, On Fri, Sep 09, 2005 at 11:33:26AM -0700 or thereabouts, Matt Wells wrote: > Martin thank you for such a fast reply. I compiled the v2 on the > server and copied the check_nrpe to the libexec dir. This is from > the same compile as the clients. Other than the check_nrpe does the > Nagios Server need anything else running that perhaps I've not run? > The clients have the /usr/local/nagios/nrpe -c > /usr/local/nagios/nrpe.cfg -d for the client. > The clients are in /usr/local/nagios and within this folder is > nrpe > nrpe.cfg > libexec/ > libexec/check_lots_of_checks_here Well I am actually interested in way how was nrpe compiled (you said that all nrpe are from the same compile), so look for compile options.. This was exactly what I was missing (to enable ssl during configuration before compilation).. I am using FreeBSD, so I used WITH_SSL=yes as compile option for nrpe port. So check your operating system and its package system for any configurable options for nrpe package. I am leaving from work, so I won't be online until morning. I hope my few lines will actually do some help. Most of the SSL errors I get are related to the allowed_hosts option. Check that your Nagios server IP address is on that line (separated by commas). Also, on your nagios server, run the check_nrpe command as the nagios user with exactly the same arguments as the Nagios server would have and display the error to us. Cheers. rob. Links: ------ [1] mailto:corwin at aeternal.net [2] mailto:chroot at wells.vg [3] mailto:nagios-users at lists.sourceforge.net -------------- next part -------------- An HTML attachment was scrubbed... URL: From magle at cacdhh.org Fri Sep 9 21:52:43 2005 From: magle at cacdhh.org (Matthew Agle) Date: Fri, 9 Sep 2005 15:52:43 -0400 Subject: suggestions for upgrading Nagios Message-ID: <001501c5b578$0aef8720$8400a8c0@sshi.local> Hello, I have currently running Nagios version 1.2 and looking to upgrade to 2.04b. Has anyone done this and/or performed a upgrade and if so what suggestions/tips would you have? Is it easier to upgrade or simply install to a different location and point the config files there? Thanks in advance for any feedback! Sincerely, Matthew Agle - MCP, MCSA Network Engineer CACDHH Information Technology Phone: (810)239-3112 ext. 233 Email: magle at cacdhh.org Disclaimer: The information transmitted is intended only for the person or entity to whom or which it is addressed and may contain confidential and/or privileged material. Any review, retransmission, dissemination or other use of this information by persons or entities other than the intended recipient is prohibited. If you receive this in error, please delete this material immediately. -------------- next part -------------- An HTML attachment was scrubbed... URL: From robmossrm at aol.com Fri Sep 9 22:06:35 2005 From: robmossrm at aol.com (Rob Moss) Date: Fri, 09 Sep 2005 21:06:35 +0100 Subject: NRPE SSL Error In-Reply-To: <7a8040aff959844c06e26811a354b1a0@open.wells.vg> References: <7a8040aff959844c06e26811a354b1a0@open.wells.vg> Message-ID: <4321EB4B.3060000@aol.com> What I meant was: On the nagios server, log in as the 'nagios' user, and run the command just as nagios might run it... for example: /usr/local/nagios/libexec/check_nrpe -H some.host.somewhere.com -c check_disk And see what happens. Maybe your SSL libraries aren't in your ld.so path or something, or you have some other problem executing the NRPE plugin which will show up if you run the command yourself. If that fails, then you may want to recompile NRPE and make sure you have the check_nrpe and the nrpe daemon copied onto both systems (you may have a mismatched copy, the client with SSL, the check program without). Cheers rob Matt Wells wrote: > I am able to run the command and get a perfect reply. > I get the nrpe[25392]: Error: Could not complete SSL handshake. 5 > every 3 minutes on the second. > > ----- Original Message ----- > Subject: Re: [Nagios-users] NRPE SSL Error > From: Rob Moss > To: Undisclosed reciepent > CC: "nagios-users at lists.sourceforge.net" > > Date: 09-09-2005 12:26 > > > Matt Wells wrote: > >> By default now it compiles with SSL enabled.. at least it said so >> in the configure file. >> >> ----- Original Message ----- >> Subject: Re: [Nagios-users] NRPE SSL Error >> From: martin hudec >> To: "Matt Wells" >> CC: nagios-users at lists.sourceforge.net >> Date: 09-09-2005 11:51 >> >> >> Hello, >> >> On Fri, Sep 09, 2005 at 11:33:26AM -0700 or thereabouts, Matt >> Wells wrote: >> > Martin thank you for such a fast reply. I compiled the v2 >> on the >> > server and copied the check_nrpe to the libexec dir. This >> is from >> > the same compile as the clients. Other than the check_nrpe >> does the >> > Nagios Server need anything else running that perhaps I've >> not run? >> > The clients have the /usr/local/nagios/nrpe -c >> > /usr/local/nagios/nrpe.cfg -d for the client. >> > The clients are in /usr/local/nagios and within this folder is >> > nrpe >> > nrpe.cfg >> > libexec/ >> > libexec/check_lots_of_checks_here >> >> Well I am actually interested in way how was nrpe compiled >> (you said >> that all nrpe are from the same compile), so look for compile >> options.. This was exactly what I was missing (to enable ssl >> during >> configuration before compilation).. I am using FreeBSD, so I used >> WITH_SSL=yes as compile option for nrpe port. So check your >> operating >> system and its package system for any configurable options >> for nrpe >> package. >> >> I am leaving from work, so I won't be online until morning. I >> hope my >> few lines will actually do some help. >> > Most of the SSL errors I get are related to the > > allowed_hosts > > option. Check that your Nagios server IP address is on that line > (separated by commas). > > Also, on your nagios server, run the check_nrpe command as the > nagios user with exactly the same arguments as the Nagios server > would have and display the error to us. > > Cheers. > rob. > -- Rob Moss Unix Systems Admin Hosting & DB Operations Hammersmith, London, UK Phone: +44 20 7348 8629 -------------- next part -------------- An HTML attachment was scrubbed... URL: From marc at ena.com Fri Sep 9 22:25:20 2005 From: marc at ena.com (Marc Powell) Date: Fri, 9 Sep 2005 15:25:20 -0500 Subject: warning using check_dns Message-ID: > -----Original Message----- > From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users- > admin at lists.sourceforge.net] On Behalf Of FM > Sent: Friday, September 09, 2005 12:53 PM > To: Mailing List Nagios > Subject: [Nagios-users] warning using check_dns > > Hello, > > Nagios 2.x > > I'm trying to use check_dns : > [trieste plugins]# ./check_dns -H urbino.lan.lexum.pri -s > dns1.lan.lexum.pri > DNS OK: 0.239 seconds response time urbino.lan.lexum.pri returns > 192.168.4.10|time=0.238538s;;;0.000000 > > [trieste plugins]# ./check_dns -H urbino.lan.lexum.pri -s > dns1.lan.lexum.pri > DNS WARNING - nslookup returned error status > > As you can see it worked at the first attempt but not at the second. check_dns is a fancy wrapper for nslookup. When it was called, nslookup returned an error and/or no output. Do you know why that might be? Can you successfully run '/path/to/nslookup -sil urbino.lan.lexum.pri dns1.lan.lexum.pri' multiple times? If it complains about the -sil, simply remove those flags and try again. If you're running a recent version of the plugins you can pass a -v flag to check_dns for verbose output. Finally, you should always do tests of this nature as the nagios user. While not an issue in the specific case, plugin execution problems can often be related to permissions for the nagios user that wouldn't be apparent when testing as root. -- Marc ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From dmourati at cm.math.uiuc.edu Sat Sep 10 00:21:36 2005 From: dmourati at cm.math.uiuc.edu (Demetri Mouratis) Date: Fri, 9 Sep 2005 17:21:36 -0500 (CDT) Subject: CGI Whoops Message on RHEL Message-ID: Hi, I'm in the process of doing a platform change from a generic Linux to Red Hat Enterprise Linux 3 ES. On the new platform, I can't get the CGIs to work at all. I get the: Whoops! Error: Could not read host and service status information! whenever I try to invoke a cgi from the web browser. The nagios process is running: [root at ops-db1 root]# ps -aef f | grep -i nagios|grep -v grep nagios 6524 1 0 15:08 ? S 0:00 /usr/bin/nagios -d /etc/nagios/nagios.cfg I'm using nagios 1.2 from dag: [root at ops-db1 cgi]# rpm -q nagios nagios-1.2-0.rhel3.dag The users are set up correctly: [root at ops-db1 root]# id apache uid=48(apache) gid=48(apache) groups=48(apache),903(nagios) [root at ops-db1 root]# id nagios uid=903(nagios) gid=903(nagios) groups=903(nagios) [root at ops-db1 nagios]# ls -lah total 104K drwx------ 6 nagios nagios 4.0K Sep 9 15:16 . drwxr-xr-x 8 root root 4.0K Sep 8 12:33 .. drwxr-xr-x 2 nagios nagios 4.0K Feb 11 2004 archives -rw------- 1 nagios nagios 148 Sep 9 15:13 .bash_history -rw-r--r-- 1 nagios nagios 24 Sep 8 00:56 .bash_logout -rw-r--r-- 1 nagios nagios 191 Sep 8 00:56 .bash_profile -rw-r--r-- 1 nagios nagios 124 Sep 8 00:56 .bashrc -rw-rw-r-- 1 nagios nagios 0 Sep 9 15:08 comment.log -rw-rw-r-- 1 nagios nagios 0 Sep 9 15:08 downtime.log drwx------ 2 nagios nagios 4.0K Sep 9 15:17 .gnupg -rw-rw-r-- 1 nagios nagios 44K Sep 9 15:14 nagios.log drwxr-sr-x 2 nagios apache 4.0K Sep 9 15:08 rw drwx------ 2 nagios nagios 4.0K Sep 9 12:18 .ssh -rw-rw-r-- 1 nagios nagios 7.9K Sep 9 15:16 status.log -rw-rw-r-- 1 nagios nagios 5.9K Sep 9 15:08 status.sav I started thinking the problem may have something to do with exec shield which Red Hat added in Update 3 so I disabled it: [root at ops-db1 nagios]# echo 0 > /proc/sys/kernel/exec-shield [root at ops-db1 nagios]# echo 0 > /proc/sys/kernel/exec-shield-randomize I can get the CGIs to work just fine from the command line: [root at ops-db1 cgi]# ./avail.cgi Cache-Control: no-store Pragma: no-cache Last-Modified: Fri, 09 Sep 2005 22:18:54 GMT Expires: Thu, 01 Jan 1970 00:00:00 GMT Content-type: text/html Nagios Availability This is driving me crazy at the moment so I'm hoping I've overlooked something obvious. Any help is greatly appreciated as I'm getting sick of wrestling with this one. Thanks. --------------------------------------------------------------------- Demetri Mouratis dmourati at linfactory.com ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From dmourati at cm.math.uiuc.edu Sat Sep 10 01:09:11 2005 From: dmourati at cm.math.uiuc.edu (Demetri Mouratis) Date: Fri, 9 Sep 2005 18:09:11 -0500 (CDT) Subject: CGI Whoops Message on RHEL In-Reply-To: References: Message-ID: On Fri, 9 Sep 2005, Demetri Mouratis wrote: > Hi, > > I'm in the process of doing a platform change from a generic Linux to Red Hat > Enterprise Linux 3 ES. On the new platform, I can't get the CGIs to work at > all. I get the: > > Whoops! > > Error: Could not read host and service status information! > > whenever I try to invoke a cgi from the web browser. The nagios process is > running: > > [root at ops-db1 root]# ps -aef f | grep -i nagios|grep -v grep > nagios 6524 1 0 15:08 ? S 0:00 /usr/bin/nagios -d > /etc/nagios/nagios.cfg > > I'm using nagios 1.2 from dag: > > [root at ops-db1 cgi]# rpm -q nagios > nagios-1.2-0.rhel3.dag > > The users are set up correctly: > > [root at ops-db1 root]# id apache > uid=48(apache) gid=48(apache) groups=48(apache),903(nagios) > [root at ops-db1 root]# id nagios > uid=903(nagios) gid=903(nagios) groups=903(nagios) > > [root at ops-db1 nagios]# ls -lah > total 104K > drwx------ 6 nagios nagios 4.0K Sep 9 15:16 . > drwxr-xr-x 8 root root 4.0K Sep 8 12:33 .. > drwxr-xr-x 2 nagios nagios 4.0K Feb 11 2004 archives > -rw------- 1 nagios nagios 148 Sep 9 15:13 .bash_history > -rw-r--r-- 1 nagios nagios 24 Sep 8 00:56 .bash_logout > -rw-r--r-- 1 nagios nagios 191 Sep 8 00:56 .bash_profile > -rw-r--r-- 1 nagios nagios 124 Sep 8 00:56 .bashrc > -rw-rw-r-- 1 nagios nagios 0 Sep 9 15:08 comment.log > -rw-rw-r-- 1 nagios nagios 0 Sep 9 15:08 downtime.log > drwx------ 2 nagios nagios 4.0K Sep 9 15:17 .gnupg > -rw-rw-r-- 1 nagios nagios 44K Sep 9 15:14 nagios.log > drwxr-sr-x 2 nagios apache 4.0K Sep 9 15:08 rw > drwx------ 2 nagios nagios 4.0K Sep 9 12:18 .ssh > -rw-rw-r-- 1 nagios nagios 7.9K Sep 9 15:16 status.log > -rw-rw-r-- 1 nagios nagios 5.9K Sep 9 15:08 status.sav > > I started thinking the problem may have something to do with exec shield > which Red Hat added in Update 3 so I disabled it: > > [root at ops-db1 nagios]# echo 0 > /proc/sys/kernel/exec-shield > [root at ops-db1 nagios]# echo 0 > /proc/sys/kernel/exec-shield-randomize > > I can get the CGIs to work just fine from the command line: > > [root at ops-db1 cgi]# ./avail.cgi > Cache-Control: no-store > Pragma: no-cache > Last-Modified: Fri, 09 Sep 2005 22:18:54 GMT > Expires: Thu, 01 Jan 1970 00:00:00 GMT > Content-type: text/html > > > > > Nagios Availability > > > > > > > > This is driving me crazy at the moment so I'm hoping I've overlooked > something obvious. Any help is greatly appreciated as I'm getting sick of > wrestling with this one. > > Thanks. > Sorry to reply to my own post but this is now fixed. It turned out that a another developer had modified the permissions on /var/log/nagios making it impossible for the CGI's to access the status and log files. For the record, 750 perms worked just fine in my config. Thanks. --------------------------------------------------------------------- Demetri Mouratis dmourati at linfactory.com ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From sghosh at sghosh.org Sat Sep 10 01:21:55 2005 From: sghosh at sghosh.org (Subhendu Ghosh) Date: Fri, 9 Sep 2005 19:21:55 -0400 (EDT) Subject: warning using check_dns In-Reply-To: <1126292424.26964.4.camel@kcm40202> References: <4321CC0E.6020305@lexum.umontreal.ca> <1126292424.26964.4.camel@kcm40202> Message-ID: A discussion this last week on the plugins list pointed to a new behaviour in 2.6.11 kernels in RHEL based systems for fork() that is causing these problems. Reports like the one below are hard to diagnose if one does not provide context like plugin version, os version. These things are requested in every message footer, yet seem to be lacking in requests for help. -sg On Fri, 9 Sep 2005, Dave Augustus wrote: > I too have had problems with this plugin. I have posted to this list and > got some replies but nothing ever really seem to fix the problem. > > I resorted to using the check_dns.pl script instead of check_dns. > > Dave > > > > On Fri, 2005-09-09 at 13:53 -0400, FM wrote: >> Hello, >> >> Nagios 2.x >> >> I'm trying to use check_dns : >> [trieste plugins]# ./check_dns -H urbino.lan.lexum.pri -s dns1.lan.lexum.pri >> DNS OK: 0.239 seconds response time urbino.lan.lexum.pri returns >> 192.168.4.10|time=0.238538s;;;0.000000 >> >> [trieste plugins]# ./check_dns -H urbino.lan.lexum.pri -s dns1.lan.lexum.pri >> DNS WARNING - nslookup returned error status >> >> As you can see it worked at the first attempt but not at the second. >> >> How to you handle dns checking ? >> >> Thanks !!! >> >> >> ------------------------------------------------------- >> SF.Net email is Sponsored by the Better Software Conference & EXPO >> September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices >> Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA >> Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf >> _______________________________________________ >> Nagios-users mailing list >> Nagios-users at lists.sourceforge.net >> https://lists.sourceforge.net/lists/listinfo/nagios-users >> ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. >> ::: Messages without supporting info will risk being sent to /dev/null > -- ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jhmartin at toger.us Sat Sep 10 03:46:23 2005 From: jhmartin at toger.us (Jason Martin) Date: Fri, 9 Sep 2005 18:46:23 -0700 Subject: Monitoring Nagios Services In-Reply-To: <99CF04974931C548B2BF20CD898FCBA7B75330@uscpgmedexch01.medassets.com> References: <99CF04974931C548B2BF20CD898FCBA7B75330@uscpgmedexch01.medassets.com> Message-ID: <20050910014622.GC23609@zippy.toger.us> On Fri, Sep 09, 2005 at 10:57:52AM -0500, Gonzales, Youn wrote: > Does anyone have any suggestions for how to monitor the nagios services > and restart them if necessary? Put nagios in non-fork mode and put it into inittab. It'll get restarted automtically. -Jason Martin -- This message is PGP/MIME signed. -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: application/pgp-signature Size: 211 bytes Desc: not available URL: From niceforums at yahoo.com Sat Sep 10 10:20:27 2005 From: niceforums at yahoo.com (hamideh daliri) Date: Sat, 10 Sep 2005 01:20:27 -0700 (PDT) Subject: nagios web interface work with enabled SELinux Message-ID: <20050910082027.78475.qmail@web30106.mail.mud.yahoo.com> hi, when i install nagis on RHEL4 i have some problems with nagios web interface and it was an internal error by apache that forends said me it is because of SELinux that is active on RHEL4 and for using web interface of nagios it sould be disabled . but now it is ok althought SELinux is enabled on my box,i have defined new type for nagios and replace the security context of all files and diectories in nagios dir with it and write some rules in apache.te . if anyone is eager in this issue tell me to describe more and put codes , tnx . ______________________________________________________ Click here to donate to the Hurricane Katrina relief effort. http://store.yahoo.com/redcross-donate3/ ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From davea at support.kcm.org Sat Sep 10 23:44:35 2005 From: davea at support.kcm.org (Dave Augustus) Date: Sat, 10 Sep 2005 16:44:35 -0500 Subject: warning using check_dns In-Reply-To: References: <4321CC0E.6020305@lexum.umontreal.ca> <1126292424.26964.4.camel@kcm40202> Message-ID: <1126388675.4488.3.camel@springer> My apologies for not supplying the information you are referring to. The footers of so many email contain useless information that I have grown accustom to ignoring them. :( Dave On Fri, 2005-09-09 at 19:21 -0400, Subhendu Ghosh wrote: > A discussion this last week on the plugins list pointed to a new behaviour > in 2.6.11 kernels in RHEL based systems for fork() that is causing these > problems. > > Reports like the one below are hard to diagnose if one does not provide > context like plugin version, os version. These things are requested in > every message footer, yet seem to be lacking in requests for help. > > -sg > > On Fri, 9 Sep 2005, Dave Augustus wrote: > > > I too have had problems with this plugin. I have posted to this list and > > got some replies but nothing ever really seem to fix the problem. > > > > I resorted to using the check_dns.pl script instead of check_dns. > > > > Dave > > > > > > > > On Fri, 2005-09-09 at 13:53 -0400, FM wrote: > >> Hello, > >> > >> Nagios 2.x > >> > >> I'm trying to use check_dns : > >> [trieste plugins]# ./check_dns -H urbino.lan.lexum.pri -s dns1.lan.lexum.pri > >> DNS OK: 0.239 seconds response time urbino.lan.lexum.pri returns > >> 192.168.4.10|time=0.238538s;;;0.000000 > >> > >> [trieste plugins]# ./check_dns -H urbino.lan.lexum.pri -s dns1.lan.lexum.pri > >> DNS WARNING - nslookup returned error status > >> > >> As you can see it worked at the first attempt but not at the second. > >> > >> How to you handle dns checking ? > >> > >> Thanks !!! > >> > >> > >> ------------------------------------------------------- > >> SF.Net email is Sponsored by the Better Software Conference & EXPO > >> September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices > >> Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA > >> Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf > >> _______________________________________________ > >> Nagios-users mailing list > >> Nagios-users at lists.sourceforge.net > >> https://lists.sourceforge.net/lists/listinfo/nagios-users > >> ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > >> ::: Messages without supporting info will risk being sent to /dev/null > > ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From rossz at vamos-wentworth.org Sun Sep 11 07:32:25 2005 From: rossz at vamos-wentworth.org (Rossz Vamos-Wentworth) Date: Sat, 10 Sep 2005 22:32:25 -0700 Subject: Passive tests and notifications Message-ID: <4323C169.4040405@vamos-wentworth.org> Yesterday I configured a couple of passive tests on one of my servers, disk space and the mysql process, to be specific. Later that night I logged on to the status screen to check things and saw that the passive tests were marked "OK", but they hadn't been updated for several hours even though the cronjob I set up was every 30 minutes. I obviously made some kind of mistake that I will deal with on monday (can't do anything right now on the remote server because I recently changed the root password and can't remember what it is, :) so have to wait until I'm in the office and can look it up). What bothers me is I never received a notification. How do I get nagios to send out a notification or fall back to an active test if the information is stale? -- Rossz ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jjk_saji at yahoo.com Sun Sep 11 11:27:22 2005 From: jjk_saji at yahoo.com (John Joseph) Date: Sun, 11 Sep 2005 10:27:22 +0100 (BST) Subject: Not able to get mail notification , Guidance requested In-Reply-To: References: Message-ID: <20050911092722.11648.qmail@web40829.mail.yahoo.com> --- Marc Powell wrote: > > > The above information indicates that you don't have > active or passive > checks enabled for the service. Is that the case? If > so, you're not > checking the service so no notifications will ever > go out. If you are > checking the service and the information above is > incorrect, check > nagios.log for a notification attempt. Check your > postfix logs. Verify > that you can send a notification by issuing your > notification commands > exactly as they are defined as the nagios user (not > root!) - post the > test here. If you still have problems, please post > the exact host and > service definitions as well as your notification > commands to this list - > the information above is not them. Nagios.log > entries around the time > that the notification should happen would be useful > as well. > > -- > Marc > > Hi Marc Thanks for the mail , I am adding the info a> ?active/passive? Now I had enabled active / passive checks for all the services for the host ?joseph? and checked it again , but , I am not getting the notification I checked nagios.log for notification attempt , I did not find any mention of notification in the nagios.log ********************************************** b > ?postfix ? My postfix MTA is working fine , and I am able to send the mail from command mode , I am able to execute the notification command from the users prompt such as c> ?notification email command? /usr/bin/printf "%b" "***** Nagios 1.X*****Notification\nType:$NOTIFICATIONTYPE$\n Host: $HOSTNAME$\nState: $HOSTSTATE$Address: $HOSTADDRESS$\nInfo: $OUTPUT$\nDate/Time: $DATETIME$" | /bin/mail -s "Host $HOSTSTATE$ alert for $HOSTNAME$" joseph at test.com *********************************************************** NOT ABLE TO SEND when I specify $CONTACTEMAIL$ from nagios prompt >From nagios users prompt , if I give exactly as in the notification command I am not getting the mail /usr/bin/printf "%b" "***** Nagios 1.X*****Notification\nType:$NOTIFICATIONTYPE$\n Host: $HOSTNAME$\nState: $HOSTSTATE$Address: $HOSTADDRESS$\nInfo: $OUTPUT$\nDate/Time: $DATETIME$" | /bin/mail -s "Host $HOSTSTATE$ alert for $HOSTNAME$" $CONTACTEMAIL$ d>mail log when I send the above command from the nagios command mode is Sep 11 12:55:54 john postfix/pickup[2509]: 749A8474130: uid=511 from= Sep 11 12:55:54 john postfix/cleanup[5950]: 749A8474130: message-id=<20050911085554.749A8474130 at john.oreon.ae> Sep 11 12:55:54 john postfix/qmgr[2510]: 749A8474130: from=, size=452, nrcpt=1 (queue active) Sep 11 12:55:54 john postfix/local[5951]: 749A8474130: to=<$@john.oreon.ae>, orig_to=<$>, relay=local, delay=0, status=bounced (unknown user: "$") Sep 11 12:55:54 john postfix/cleanup[5950]: 8862847412E: message-id=<20050911085554.8862847412E at john.oreon.ae> Sep 11 12:55:54 john postfix/qmgr[2510]: 8862847412E: from=<>, size=2101, nrcpt=1 (queue active) Sep 11 12:55:54 john postfix/qmgr[2510]: 749A8474130: removed Sep 11 12:55:54 john postfix/local[5951]: 8862847412E: to=, relay=local, delay=0, status=sent (delivered to mailbox) Sep 11 12:55:54 john postfix/qmgr[2510]: 8862847412E: removed Sep 11 12:55:56 john postfix/pickup[2509]: 6AF85474130: uid=511 from= Sep 11 12:55:56 john postfix/cleanup[5950]: 6AF85474130: message-id=<20050911085556.6AF85474130 at john.oreon.ae> Sep 11 12:55:56 john postfix/qmgr[2510]: 6AF85474130: from=, size=452, nrcpt=1 (queue active) Sep 11 12:55:56 john postfix/local[5951]: 6AF85474130: to=<$@john.oreon.ae>, orig_to=<$>, relay=local, delay=0, status=bounced (unknown user: "$") Sep 11 12:55:56 john postfix/cleanup[5950]: 7557A47412E: message-id=<20050911085556.7557A47412E at john.oreon.ae> Sep 11 12:55:56 john postfix/qmgr[2510]: 7557A47412E: from=<>, size=2101, nrcpt=1 (queue active) Sep 11 12:55:56 john postfix/qmgr[2510]: 6AF85474130: removed Sep 11 12:55:56 john postfix/local[5951]: 7557A47412E: to=, relay=local, delay=0, status=sent (delivered to mailbox) Sep 11 12:55:56 john postfix/qmgr[2510]: 7557A47412E: removed Sep 11 12:55:57 john postfix/pickup[2509]: 2740A474130: uid=511 from= Sep 11 12:55:57 john postfix/cleanup[5950]: 2740A474130: message-id=<20050911085557.2740A474130 at john.oreon.ae> Sep 11 12:55:57 john postfix/qmgr[2510]: 2740A474130: from=, size=452, nrcpt=1 (queue active) Sep 11 12:55:57 john postfix/local[5951]: 2740A474130: to=<$@john.oreon.ae>, orig_to=<$>, relay=local, delay=0, status=bounced (unknown user: "$") Sep 11 12:55:57 john postfix/cleanup[5950]: 29ED8474094: message-id=<20050911085557.29ED8474094 at john.oreon.ae> Sep 11 12:55:57 john postfix/qmgr[2510]: 29ED8474094: from=<>, size=2101, nrcpt=1 (queue active) Sep 11 12:55:57 john postfix/qmgr[2510]: 2740A474130: removed Sep 11 12:55:57 john postfix/local[5951]: 29ED8474094: to=, relay=local, delay=0, status=sent (delivered to mailbox) Sep 11 12:55:57 john postfix/qmgr[2510]: 29ED8474094: removed ********************************************** I am sending the hosts.cfg , services.cfg , misccommands.cfg as attachment , Kindly guide me Thanks Joseph John ___________________________________________________________ How much free photo storage do you get? Store your holiday snaps for FREE with Yahoo! Photos http://uk.photos.yahoo.com -------------- next part -------------- A non-text attachment was scrubbed... Name: hosts.cfg Type: application/octet-stream Size: 15836 bytes Desc: 1819780330-hosts.cfg URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: services.cfg Type: application/octet-stream Size: 34156 bytes Desc: 4051563864-services.cfg URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: misccommands.cfg Type: application/octet-stream Size: 3249 bytes Desc: 3418209448-misccommands.cfg URL: From f1216 at yahoo.com Sun Sep 11 16:11:44 2005 From: f1216 at yahoo.com (Fred) Date: Sun, 11 Sep 2005 07:11:44 -0700 (PDT) Subject: oscp command design and FIFO locking? In-Reply-To: <20050911092722.11648.qmail@web40829.mail.yahoo.com> References: <20050911092722.11648.qmail@web40829.mail.yahoo.com> Message-ID: <20050911141144.46193.qmail@web31911.mail.mud.yahoo.com> Does anyone have an idea why the oscp command (for distributed monitoring) would kick off more then one command at a time? For example, if there are a number of checks that are completed, nagios kicks off multiple oscp scripts (submit commands). This causes the design of the submit command to need to throttle the access to whatever resources it might need to touch. If using the default send_nsca command, there can now be multiple (and many multiple) send_nsca's kicked off and each of these on the target server will all be attempting to write to the nagios FIFO. The nagios FIFO can get horribly overloaded. If the nagios master demon is not aggresively reading the FIFO (check_command_interval=-1) then the demons can stack up and eventually consume socket resources and memory etc. As far as I can tell, nsca doesn't lock the FIFO, which also means that writes will get intermixed with writes from plug-ins that might be running on the master system. (I have seen this over and over) To avoid this, I have had to implement serious locking in all plug-ins and not use nsca as it has no locking mechanism (that I know of). Right now I am fighting with the oscp commands that can launch dozens of copies at a time and each of these (in my case) write to a local file that will eventually be pushed up to the master and written (while locking) the nagios FIFO. So ... I guess my questions are: 1) Should nagios be forking off more then one oscp command at a time? 2) Has anyone else run into FIFO corruption because of the lack of advisory locking in all the plug-ins? Thanks in advance for any thoughts or observations here. -FredC ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Sun Sep 11 16:35:35 2005 From: marc at ena.com (Marc Powell) Date: Sun, 11 Sep 2005 09:35:35 -0500 Subject: Passive tests and notifications Message-ID: > -----Original Message----- > From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users- > admin at lists.sourceforge.net] On Behalf Of Rossz Vamos-Wentworth > Sent: Sunday, September 11, 2005 12:32 AM > To: nagios-users at lists.sourceforge.net > Subject: [Nagios-users] Passive tests and notifications > > Yesterday I configured a couple of passive tests on one of my servers, > disk space and the mysql process, to be specific. Later that night I > logged on to the status screen to check things and saw that the passive > tests were marked "OK", but they hadn't been updated for several hours > even though the cronjob I set up was every 30 minutes. I obviously made > some kind of mistake that I will deal with on monday (can't do anything > right now on the remote server because I recently changed the root > password and can't remember what it is, :) so have to wait until I'm in > the office and can look it up). > > What bothers me is I never received a notification. How do I get nagios > to send out a notification or fall back to an active test if the > information is stale? Use freshness checking. http://nagios.sourceforge.net/docs/1_0/freshness.html Marc ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Sun Sep 11 17:12:02 2005 From: marc at ena.com (Marc Powell) Date: Sun, 11 Sep 2005 10:12:02 -0500 Subject: oscp command design and FIFO locking? Message-ID: > -----Original Message----- > From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users- > admin at lists.sourceforge.net] On Behalf Of Fred > Sent: Sunday, September 11, 2005 9:12 AM > To: Nagios User > Subject: [Nagios-users] oscp command design and FIFO locking? > > > Does anyone have an idea why the oscp command (for distributed monitoring) > would > kick off more then one command at a time? For example, if there are a > number > of checks that are completed, nagios kicks off multiple oscp scripts > (submit > commands). Since the OCSP command can be and do anything, it must be run once per check. Nagios can't predict what you're using the OCSP command for and whether batching, as you seem to desire, would be applicable. Distributed monitoring is just one application of OCSP. If you really want the batching behavior, build it into your OCSP command. > This causes the design of the submit command to need to throttle the > access > to whatever resources it might need to touch. If using the default > send_nsca > command, there can now be multiple (and many multiple) send_nsca's kicked > off > and each of these on the target server will all be attempting to write to > the nagios FIFO. The nagios FIFO can get horribly overloaded. If the > nagios > master demon is not aggresively reading the FIFO (check_command_interval=- > 1) > then the demons can stack up and eventually consume socket resources and I handle approximately 3300 passive checks every 5 minutes on somewhat commodity hardware (quad pIII 800) using NSCA with no problems. I anticipate that I can double and possibly triple that number as the FIFO is empty approximately 1/3 of the time. Are you doing significantly more passive checks than that? > memory etc. As far as I can tell, nsca doesn't lock the FIFO, which also > means that writes will get intermixed with writes from plug-ins that might > be > running on the master system. (I have seen this over and over) I don't see how. Local active checks, at least the standard plugins, don't use nagios.cmd in any way. This would also be contrary to the blocking behavior you comment on above where your OS is essentially 'locking' the FIFO until it has been cleared. As far as your OS is concerned, there is no distinction between NSCA trying to write to the pipe and some other process doing the same. While others are more versed in this than I am, it is my understanding that if the program is trying to write more data to the pipe than it can currently hold it will be prevented from doing so by the OS, only one process can write to the FIFO at a time and that all writes are atomic. This presumes that the plugin output is < the max FIFO length supported by your OS. > > To avoid this, I have had to implement serious locking in all plug-ins and > not use nsca as it has no locking mechanism (that I know of). I'm curious about how you've done this. What exactly are you locking? How is it helping? NSCA shouldn't need locking as it depends on your OS to control access to the FIFO. > Right now I am fighting with the oscp commands that can launch dozens of > copies at a time and each of these (in my case) write to a local file that > will eventually be pushed up to the master and written (while locking) the > nagios FIFO. > > So ... I guess my questions are: > > 1) Should nagios be forking off more then one oscp command at a time? Yes, one per check. > 2) Has anyone else run into FIFO corruption because of the lack of > advisory > locking in all the plug-ins? Not here in almost 4 years of using Nagios/Netsaint. -- Marc ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From f1216 at yahoo.com Sun Sep 11 18:37:20 2005 From: f1216 at yahoo.com (Fred) Date: Sun, 11 Sep 2005 09:37:20 -0700 (PDT) Subject: oscp command design and FIFO locking? In-Reply-To: References: Message-ID: <20050911163720.11118.qmail@web31902.mail.mud.yahoo.com> Marc, Thanks for the detailed reply. I've attempted to be a bit more clear in the comments below. --- Marc Powell wrote: > > > > -----Original Message----- > > From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users- > > admin at lists.sourceforge.net] On Behalf Of Fred > > Sent: Sunday, September 11, 2005 9:12 AM > > To: Nagios User > > Subject: [Nagios-users] oscp command design and FIFO locking? > > > > > > Does anyone have an idea why the oscp command (for distributed > monitoring) > > would > > kick off more then one command at a time? For example, if there are a > > number > > of checks that are completed, nagios kicks off multiple oscp scripts > > (submit > > commands). > > Since the OCSP command can be and do anything, it must be run once per > check. Nagios can't predict what you're using the OCSP command for and > whether batching, as you seem to desire, would be applicable. > Distributed monitoring is just one application of OCSP. If you really > want the batching behavior, build it into your OCSP command. > It was my impression that this command was intended for distributed monitoring and that other hooks exist to provide control to other types of commands for other purposes. > > This causes the design of the submit command to need to throttle the > > access > > to whatever resources it might need to touch. If using the default > > send_nsca > > command, there can now be multiple (and many multiple) send_nsca's > kicked > > off > > and each of these on the target server will all be attempting to write > to > > the nagios FIFO. The nagios FIFO can get horribly overloaded. If the > > nagios > > master demon is not aggresively reading the FIFO > (check_command_interval=- > > 1) > > then the demons can stack up and eventually consume socket resources > and > > I handle approximately 3300 passive checks every 5 minutes on somewhat > commodity hardware (quad pIII 800) using NSCA with no problems. I > anticipate that I can double and possibly triple that number as the FIFO > is empty approximately 1/3 of the time. Are you doing significantly more > passive checks than that? > Most likely ... on one installation I have over 1040 nodes, over 10,500 checks, 99% of which are passive and involve plug-ins which write to the nagios.cmd FIFO. Each compute node defines 10 passive service check definitions, each service node defines an additional 10 active checks. The nsca demon forks children to write to nagios.cmd as a result of a send_nsca connection request. If at the same time, some plug-in tries to write to this file, there is a good chance that the buffers can be interspersed if both the nsca process and the plug-in do not observe any kind of lock mechanism. This can also occur when nagios forks off multiple service check plug-ins that both want to write to the FIFO. It took a system configuration of about 120 or so nodes for this to start happening for me. It wasn't consistent and it isn't fatal. If you looked closely, the nagios.log would report an invalid command and then read the next line of the FIFO and move on, however, the data from that line would be lost. Since implementing a lock around writing to the FIFO from all my plug-ins, this has not occurred. Note, in my smaller configurations, I don't use nsca as there is no distributed monitoring. The contention in these smaller systems is between concurrently running plug-ins. > > memory etc. As far as I can tell, nsca doesn't lock the FIFO, which > also > > means that writes will get intermixed with writes from plug-ins that > might > > be > > running on the master system. (I have seen this over and over) > > I don't see how. Local active checks, at least the standard plugins, > don't use nagios.cmd in any way. This would also be contrary to the > blocking behavior you comment on above where your OS is essentially > 'locking' the FIFO until it has been cleared. As far as your OS is > concerned, there is no distinction between NSCA trying to write to the > pipe and some other process doing the same. While others are more versed > in this than I am, it is my understanding that if the program is trying > to write more data to the pipe than it can currently hold it will be > prevented from doing so by the OS, only one process can write to the > FIFO at a time and that all writes are atomic. This presumes that the > plugin output is < the max FIFO length supported by your OS. I use few local active checks. Those that I do use, typically are kicked off to generate per-node data that is written to the nagios.cmd FIFO, one line item for each node. With the FIFO on a 4k block filesystem, that isn't too much room before it fills. At about 80-120 chars per message, it only takes 30-50 messages to fill the FIFO then the plug-in is blocked waiting for nagios to read it. If nagios only reads it every 15 seconds, it could easily take over a minute to read 128 messages (128 nodes). More then one process can write to a FIFO at a time, it is just a unix file opened for append. The OS doesn't control this, the user application has to. It gets worse ... if nagios spins off more then one plug-in that in turn writes to the FIFO, and each of those want to write say 128 lines of data, they can easily toast each other. Nagios does have a setting to keep the number of concurrent processes to 1, but that seems to be too big a hammer for this problem. In any case, locking between plug-ins (and wrapping any existing ones with locks) works well. I also set my nagios demon to aggresively read from the FIFO, otherwise things start timing out (with a service check timeout at say 60-120 seconds) While I have few local checks, they are the core of my monitoring system as they are resposnible for filling in all the per-node information for the majority of the passive checks, for example, I have a syslog monitor plugin that runs and parses the recent syslog messages, compares against interesting patterns, and then formats a line for each node that has something interesting and writes that to the FIFO, for those nodes that do not have any interesting content, it formats a line that says nothing matched (if I didn't do that, the service check would never fill any data in or it would go stale) Other plug-ins report per-node statistics and format this into the FIFO. Each node has passive check definitions for these results. > > > > > To avoid this, I have had to implement serious locking in all plug-ins > and > > not use nsca as it has no locking mechanism (that I know of). > > I'm curious about how you've done this. What exactly are you locking? > How is it helping? NSCA shouldn't need locking as it depends on your OS > to control access to the FIFO. > > > Right now I am fighting with the oscp commands that can launch dozens > of > > copies at a time and each of these (in my case) write to a local file > that > > will eventually be pushed up to the master and written (while locking) > the > > nagios FIFO. > > > > So ... I guess my questions are: > > > > 1) Should nagios be forking off more then one oscp command at a time? > > Yes, one per check. > > > 2) Has anyone else run into FIFO corruption because of the lack of > > advisory > > locking in all the plug-ins? > > Not here in almost 4 years of using Nagios/Netsaint. Again, thanks for the input. > > -- > Marc > > > ------------------------------------------------------- > SF.Net email is Sponsored by the Better Software Conference & EXPO > September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices > Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA > Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting > any issue. > ::: Messages without supporting info will risk being sent to /dev/null > ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From g.vickers at qut.edu.au Mon Sep 12 02:30:25 2005 From: g.vickers at qut.edu.au (Greg Vickers) Date: Mon, 12 Sep 2005 10:30:25 +1000 Subject: suggestions for upgrading Nagios In-Reply-To: <001501c5b578$0aef8720$8400a8c0@sshi.local> References: <001501c5b578$0aef8720$8400a8c0@sshi.local> Message-ID: <4324CC21.1080204@qut.edu.au> Matt, Matthew Agle wrote: > Hello, > > I have currently running Nagios version 1.2 and looking to > upgrade to 2.04b. Has anyone done this and/or performed a upgrade and > if so what suggestions/tips would you have? Is it easier to upgrade or > simply install to a different location and point the config files > there? Thanks in advance for any feedback! RTFM is your friend. I would very carefully read what is new in version 2.0, in the "What's new in this version" section of the online manual. http://www.nagios.org, Support, Online Documentation, v2.x HTML, Table of Contents, What's new in this version.... If I were you, I would pay special attention to the Hostgroup changes. Take a copy of your 1.x config and run the 2.x binary against it, see what is broken, that will give you a good starting point. HTH, [p.s.] OK, ok, the only change you *have* to make is moving the new location of the contact_groups directive... but there's so much other new and better stuff in v2.x, have a look and see what is applicable to your situation. -- Greg Vickers Project Manager, IT Security Information Technology Services Queensland University of Technology L12, 126 Margaret St, Brisbane Phone: (07) 3864 9536 Email: g.vickers at qut.edu.au IT Security web site: http://www.its.qut.edu.au/itsecurity/ CRICOS No. 00213J ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From f1216 at yahoo.com Mon Sep 12 03:29:29 2005 From: f1216 at yahoo.com (Fred) Date: Sun, 11 Sep 2005 18:29:29 -0700 (PDT) Subject: suggestions for upgrading Nagios In-Reply-To: <4324CC21.1080204@qut.edu.au> References: <4324CC21.1080204@qut.edu.au> Message-ID: <20050912012929.13978.qmail@web31906.mail.mud.yahoo.com> I believe there are a number of changed configuration variables that will be obvious when you try to start a 1.2 config under 2.0b*, I know there were a number of syntax errors when I upgraded. -FredC --- Greg Vickers wrote: > Matt, > > Matthew Agle wrote: > > Hello, > > > > I have currently running Nagios version 1.2 and looking to > > upgrade to 2.04b. Has anyone done this and/or performed a upgrade and > > if so what suggestions/tips would you have? Is it easier to upgrade or > > simply install to a different location and point the config files > > there? Thanks in advance for any feedback! > > RTFM is your friend. I would very carefully read what is new in version > 2.0, in the "What's new in this version" section of the online manual. > > http://www.nagios.org, Support, Online Documentation, v2.x HTML, Table > of Contents, What's new in this version.... > > If I were you, I would pay special attention to the Hostgroup changes. > > Take a copy of your 1.x config and run the 2.x binary against it, see > what is broken, that will give you a good starting point. > > HTH, > > [p.s.] OK, ok, the only change you *have* to make is moving the new > location of the contact_groups directive... but there's so much other > new and better stuff in v2.x, have a look and see what is applicable to > your situation. > > -- > Greg Vickers > Project Manager, IT Security > Information Technology Services > Queensland University of Technology > L12, 126 Margaret St, Brisbane > > Phone: (07) 3864 9536 > Email: g.vickers at qut.edu.au > IT Security web site: http://www.its.qut.edu.au/itsecurity/ > > CRICOS No. 00213J > > > ------------------------------------------------------- > SF.Net email is Sponsored by the Better Software Conference & EXPO > September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices > Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA > Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting > any issue. > ::: Messages without supporting info will risk being sent to /dev/null > ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Mon Sep 12 08:53:28 2005 From: ae at op5.se (Andreas Ericsson) Date: Mon, 12 Sep 2005 08:53:28 +0200 Subject: suggestions for upgrading Nagios In-Reply-To: <20050912012929.13978.qmail@web31906.mail.mud.yahoo.com> References: <20050912012929.13978.qmail@web31906.mail.mud.yahoo.com> Message-ID: <432525E8.7080001@op5.se> Fred wrote: > I believe there are a number of changed configuration variables that > will be obvious when you try to start a 1.2 config under 2.0b*, I know > there were a number of syntax errors when I upgraded. > Don't forget the macros in the notification commands, or we'll be answering that particular question for the 16033rd time tomorrow. > -FredC > > --- Greg Vickers wrote: > > >>Matt, >> >>Matthew Agle wrote: >> >>>Hello, >>> >>> I have currently running Nagios version 1.2 and looking to >>>upgrade to 2.04b. Has anyone done this and/or performed a upgrade and >>>if so what suggestions/tips would you have? Is it easier to upgrade or >>>simply install to a different location and point the config files >>>there? Thanks in advance for any feedback! >> >>RTFM is your friend. I would very carefully read what is new in version >>2.0, in the "What's new in this version" section of the online manual. >> >>http://www.nagios.org, Support, Online Documentation, v2.x HTML, Table >>of Contents, What's new in this version.... >> >>If I were you, I would pay special attention to the Hostgroup changes. >> >>Take a copy of your 1.x config and run the 2.x binary against it, see >>what is broken, that will give you a good starting point. >> >>HTH, >> >>[p.s.] OK, ok, the only change you *have* to make is moving the new >>location of the contact_groups directive... but there's so much other >>new and better stuff in v2.x, have a look and see what is applicable to >>your situation. >> >>-- >>Greg Vickers >>Project Manager, IT Security >>Information Technology Services >>Queensland University of Technology >>L12, 126 Margaret St, Brisbane >> >>Phone: (07) 3864 9536 >>Email: g.vickers at qut.edu.au >>IT Security web site: http://www.its.qut.edu.au/itsecurity/ >> >>CRICOS No. 00213J >> >> >>------------------------------------------------------- >>SF.Net email is Sponsored by the Better Software Conference & EXPO >>September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices >>Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA >>Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf >>_______________________________________________ >>Nagios-users mailing list >>Nagios-users at lists.sourceforge.net >>https://lists.sourceforge.net/lists/listinfo/nagios-users >>::: Please include Nagios version, plugin version (-v) and OS when reporting >>any issue. >>::: Messages without supporting info will risk being sent to /dev/null >> > > > > > > > > > ------------------------------------------------------- > SF.Net email is Sponsored by the Better Software Conference & EXPO > September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices > Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA > Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Lead Developer ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Mon Sep 12 09:16:05 2005 From: ae at op5.se (Andreas Ericsson) Date: Mon, 12 Sep 2005 09:16:05 +0200 Subject: oscp command design and FIFO locking? In-Reply-To: <20050911163720.11118.qmail@web31902.mail.mud.yahoo.com> References: <20050911163720.11118.qmail@web31902.mail.mud.yahoo.com> Message-ID: <43252B35.4080001@op5.se> Fred wrote: >>>This causes the design of the submit command to need to throttle the >>>access >>>to whatever resources it might need to touch. If using the default >>>send_nsca >>>command, there can now be multiple (and many multiple) send_nsca's >> >>kicked >> >>>off >>>and each of these on the target server will all be attempting to write >> >>to >> >>>the nagios FIFO. The nagios FIFO can get horribly overloaded. If the >>>nagios >>>master demon is not aggresively reading the FIFO >> >>(check_command_interval=- >> >>>1) >>>then the demons can stack up and eventually consume socket resources >> >>and >> >>I handle approximately 3300 passive checks every 5 minutes on somewhat >>commodity hardware (quad pIII 800) using NSCA with no problems. I >>anticipate that I can double and possibly triple that number as the FIFO >>is empty approximately 1/3 of the time. Are you doing significantly more >>passive checks than that? >> > > > Most likely ... on one installation I have over 1040 nodes, over 10,500 > checks, 99% of which are passive and involve plug-ins which write to the > nagios.cmd FIFO. Each compute node defines 10 passive service check > definitions, each service node defines an additional 10 active checks. > > The nsca demon forks children to write to nagios.cmd as > a result of a send_nsca connection request. If at the same time, some plug-in > tries to write to this file, there is a good chance that the buffers can > be interspersed if both the nsca process and the plug-in do not observe any > kind of lock mechanism. This can also occur when nagios forks off multiple > service check plug-ins that both want to write to the FIFO. It took a system > configuration of about 120 or > so nodes for this to start happening for me. It wasn't consistent and it > isn't fatal. If you looked closely, the nagios.log would report an invalid > command and then read the next line of the FIFO and move on, however, the > data from that line would be lost. Since implementing a lock around writing > to the FIFO from all my plug-ins, this has not occurred. Note, in my smaller > configurations, I don't use nsca as there is no distributed monitoring. The > contention in these smaller systems is between concurrently running plug-ins. > If you read the code you'll notice that the active checks also write their service results to the FIFO. This is a showstopper on the road to "scale like hell", so a few various other methods are being tested. Multiplexing several children from a single parent seems the way to go. 509 checks can run smoothly at once on a modern system (round about 1017 if you don't let the child have an stderr). The limit is set by sysconf(_SC_OPEN_MAX) / 2, or sysconf(_SC_CHILD_MAX), whichever is lowest. > >>>memory etc. As far as I can tell, nsca doesn't lock the FIFO, which >> >>also >> >>>means that writes will get intermixed with writes from plug-ins that >> >>might >> >>>be >>>running on the master system. (I have seen this over and over) >> >>I don't see how. Local active checks, at least the standard plugins, >>don't use nagios.cmd in any way. This is incorrect. See above. > This would also be contrary to the >>blocking behavior you comment on above where your OS is essentially >>'locking' the FIFO until it has been cleared. As far as your OS is >>concerned, there is no distinction between NSCA trying to write to the >>pipe and some other process doing the same. While others are more versed >>in this than I am, it is my understanding that if the program is trying >>to write more data to the pipe than it can currently hold it will be >>prevented from doing so by the OS, only one process can write to the >>FIFO at a time and that all writes are atomic. This presumes that the >>plugin output is < the max FIFO length supported by your OS. > Actually, the write(2) command will write some data, but not all. The smallest guaranteed atomic write size is 512 on posix systems. Obviously, this is larger on most, but it can't be infinite so all writes aren't atomic. > > I use few local active checks. Those that I do use, typically are kicked > off to generate per-node data that is written to the nagios.cmd FIFO, one > line item for each node. With the FIFO on a 4k block filesystem, that isn't > too much room before it fills. At about 80-120 chars per message, it only > takes 30-50 messages > to fill the FIFO then the plug-in is blocked waiting for nagios to read it. > If nagios only reads it every 15 seconds, it could easily take over a minute > to read 128 messages (128 nodes). So set service_result_reaper_frequency (or some such) to 2. Having it at 15 in a large environment just won't work. > More then one process can write to a FIFO > at a time, it is just a unix file opened for append. The OS doesn't control > this, the user application has to. It gets worse ... if nagios spins off > more then one plug-in that in turn writes to the FIFO, and each of those > want to write say 128 lines of data, they can easily toast each other. Nagios > does have a setting to keep the number of concurrent processes to 1, but that > seems to be too big a hammer for this problem. In any case, locking between > plug-ins (and wrapping any existing ones with locks) works well. I also set > my nagios demon to aggresively read from the FIFO, otherwise things start > timing out (with a service check timeout at say 60-120 seconds) > > While I have few local checks, they are the core of my monitoring system as > they are resposnible for filling in all the per-node information for the > majority of the passive checks, for example, I have a syslog monitor plugin > that runs and parses the recent syslog messages, compares against interesting > patterns, and then formats a line for each node that has something interesting > and writes that to the FIFO, for those nodes that do not have any interesting > content, it formats a line that says nothing matched (if I didn't do that, the > service check would never fill any data in or it would go stale) Other > plug-ins report per-node statistics and format this into the FIFO. Each node > has passive check definitions for these results. > > >>>To avoid this, I have had to implement serious locking in all plug-ins >> >>and >> >>>not use nsca as it has no locking mechanism (that I know of). >> A better solution would have been to implement a local UDP socket mechanism. The reaper in Nagios can easily multiplex, and the receive buffers on sockets can be dynamically increased from the program creating it (up to at least 65536 bytes even on very old linuxes). >>I'm curious about how you've done this. What exactly are you locking? >>How is it helping? NSCA shouldn't need locking as it depends on your OS >>to control access to the FIFO. >> >> >>>Right now I am fighting with the oscp commands that can launch dozens >> >>of >> >>>copies at a time and each of these (in my case) write to a local file >> >>that >> >>>will eventually be pushed up to the master and written (while locking) >> >>the >> >>>nagios FIFO. >>> >>>So ... I guess my questions are: >>> >>>1) Should nagios be forking off more then one oscp command at a time? >> >>Yes, one per check. >> >> >>>2) Has anyone else run into FIFO corruption because of the lack of >>>advisory >>> locking in all the plug-ins? >> This is quite a misplaced question. The plugins just write to a file-descriptor they think is stdout, but is really a pipe opened by nagios (using the pipe(2) syscall) specifically for that plugin. That pipe doesn't get filled as only one plugin is writing to it. It's nagios itself that writes to its own FIFO. >>Not here in almost 4 years of using Nagios/Netsaint. > > > Again, thanks for the input. > > >>-- >>Marc >> >> >>------------------------------------------------------- >>SF.Net email is Sponsored by the Better Software Conference & EXPO >>September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices >>Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA >>Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf >>_______________________________________________ >>Nagios-users mailing list >>Nagios-users at lists.sourceforge.net >>https://lists.sourceforge.net/lists/listinfo/nagios-users >>::: Please include Nagios version, plugin version (-v) and OS when reporting >>any issue. >>::: Messages without supporting info will risk being sent to /dev/null >> > > > > > > > > > ------------------------------------------------------- > SF.Net email is Sponsored by the Better Software Conference & EXPO > September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices > Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA > Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Lead Developer ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jjk_saji at yahoo.com Mon Sep 12 09:45:44 2005 From: jjk_saji at yahoo.com (John Joseph) Date: Mon, 12 Sep 2005 08:45:44 +0100 (BST) Subject: No notifications in "nagios.log" , Message-ID: <20050912074544.38615.qmail@web40822.mail.yahoo.com> Hi When I was trouble shooting for the not receiving email -notification , I found that my ? usr/local/nagios/var/nagios.log? do not have any notification in it , while it has other details such as history , status , trends I think if I am able to find out the reason , why there is no info of notification in nagios.log , I can solve my mail problem Help requested Thanks Joseph John Send instant messages to your online friends http://uk.messenger.yahoo.com ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From oliver.steenbuck at lhsystems.com Mon Sep 12 13:42:09 2005 From: oliver.steenbuck at lhsystems.com (oliver.steenbuck at lhsystems.com) Date: Mon, 12 Sep 2005 13:42:09 +0200 Subject: Scheduling (ordering) of Service checks Message-ID: <79BF0B53C4A28446A097D6170CF4C69955ED0C@xw2k3-hammbx-03.ads.dlh.de> My situation is roughy as follows: I need to check some (html) applications. This application checking needs different steps (e.g. Login, check_1, check_2, Logout). Obviously the "logout" step should only happen after, the "check" steps have returned. Currently every "step" is implemented as a service. Is it possible to order the execution of different serivce checks in a certain way. (Not asfar as I can see but I may ofcourse be mistaken) We are running Nagios 1.2 If the answer is no, has somebody already done meaningfull work in the direction of adding such a facillity to nagios. I would be soo much interested in any clues or insights. Kind Regards Oliver Steenbuck -------------- next part -------------- An HTML attachment was scrubbed... URL: From f1216 at yahoo.com Mon Sep 12 13:53:53 2005 From: f1216 at yahoo.com (Fred) Date: Mon, 12 Sep 2005 04:53:53 -0700 (PDT) Subject: oscp command design and FIFO locking? In-Reply-To: <43252B35.4080001@op5.se> References: <43252B35.4080001@op5.se> Message-ID: <20050912115353.64819.qmail@web31912.mail.mud.yahoo.com> Andreas, Thank you for the comments. This mail thread is getting visually ugly because of the word wraps, so I'm not going to comment inline. A few more questions/clarifications follow: SC_OPEN_MAX is probably what I am hitting, it is configured to be 1024 on our system. This could explain quite a bit. To be clear, the issues that I originally described about writing to the nagios.cmd fifo were not related in any way to the directly launched plugins from nagios, i.e, I have no doubt that nagios does the right thing internally to insure consistency. What nagios has no control over is essentially async processes that are writing to the nagios.cmd fifo with the intent of providing passive check input to nagios. i.e., echo "a bunch of lines of passive-check-results ... " >>nagios.cmd while nagios is running (especially if nagios is also writing its own active check results here!) could cause lots of trouble if there are no observed locks. The above is essentially what happens in my system (where the echo is a really a set of perl scripts that all take turns writing the fifo) This was the reasons behind my question #2 about FIFO corruption. Again, thank you for the SC_OPEN_MAX pointer ..., I think what may have caused my problems may have been a recent addition of host-checks, this will cause more open descriptors that previously used and may have pushed things over the edge. -FredC --- Andreas Ericsson wrote: > Fred wrote: > >>>This causes the design of the submit command to need to throttle the > >>>access > >>>to whatever resources it might need to touch. If using the default > >>>send_nsca > >>>command, there can now be multiple (and many multiple) send_nsca's > >> > >>kicked > >> > >>>off > >>>and each of these on the target server will all be attempting to write > >> > >>to > >> > >>>the nagios FIFO. The nagios FIFO can get horribly overloaded. If the > >>>nagios > >>>master demon is not aggresively reading the FIFO > >> > >>(check_command_interval=- > >> > >>>1) > >>>then the demons can stack up and eventually consume socket resources > >> > >>and > >> > >>I handle approximately 3300 passive checks every 5 minutes on somewhat > >>commodity hardware (quad pIII 800) using NSCA with no problems. I > >>anticipate that I can double and possibly triple that number as the FIFO > >>is empty approximately 1/3 of the time. Are you doing significantly more > >>passive checks than that? > >> > > > > > > Most likely ... on one installation I have over 1040 nodes, over 10,500 > > checks, 99% of which are passive and involve plug-ins which write to the > > nagios.cmd FIFO. Each compute node defines 10 passive service check > > definitions, each service node defines an additional 10 active checks. > > > > The nsca demon forks children to write to nagios.cmd as > > a result of a send_nsca connection request. If at the same time, some > plug-in > > tries to write to this file, there is a good chance that the buffers can > > be interspersed if both the nsca process and the plug-in do not observe any > > kind of lock mechanism. This can also occur when nagios forks off > multiple > > service check plug-ins that both want to write to the FIFO. It took a > system > > configuration of about 120 or > > so nodes for this to start happening for me. It wasn't consistent and it > > isn't fatal. If you looked closely, the nagios.log would report an invalid > > command and then read the next line of the FIFO and move on, however, the > > data from that line would be lost. Since implementing a lock around > writing > > to the FIFO from all my plug-ins, this has not occurred. Note, in my > smaller > > configurations, I don't use nsca as there is no distributed monitoring. > The > > contention in these smaller systems is between concurrently running > plug-ins. > > > > If you read the code you'll notice that the active checks also write > their service results to the FIFO. This is a showstopper on the road to > "scale like hell", so a few various other methods are being tested. > Multiplexing several children from a single parent seems the way to go. > 509 checks can run smoothly at once on a modern system (round about 1017 > if you don't let the child have an stderr). The limit is set by > sysconf(_SC_OPEN_MAX) / 2, or sysconf(_SC_CHILD_MAX), whichever is lowest. > > > > >>>memory etc. As far as I can tell, nsca doesn't lock the FIFO, which > >> > >>also > >> > >>>means that writes will get intermixed with writes from plug-ins that > >> > >>might > >> > >>>be > >>>running on the master system. (I have seen this over and over) > >> > >>I don't see how. Local active checks, at least the standard plugins, > >>don't use nagios.cmd in any way. > > > This is incorrect. See above. > > > > This would also be contrary to the > >>blocking behavior you comment on above where your OS is essentially > >>'locking' the FIFO until it has been cleared. As far as your OS is > >>concerned, there is no distinction between NSCA trying to write to the > >>pipe and some other process doing the same. While others are more versed > >>in this than I am, it is my understanding that if the program is trying > >>to write more data to the pipe than it can currently hold it will be > >>prevented from doing so by the OS, only one process can write to the > >>FIFO at a time and that all writes are atomic. This presumes that the > >>plugin output is < the max FIFO length supported by your OS. > > > > Actually, the write(2) command will write some data, but not all. The > smallest guaranteed atomic write size is 512 on posix systems. > Obviously, this is larger on most, but it can't be infinite so all > writes aren't atomic. > > > > > > I use few local active checks. Those that I do use, typically are kicked > > off to generate per-node data that is written to the nagios.cmd FIFO, one > > line item for each node. With the FIFO on a 4k block filesystem, that > isn't > > too much room before it fills. At about 80-120 chars per message, it only > > takes 30-50 messages > > to fill the FIFO then the plug-in is blocked waiting for nagios to read it. > > If nagios only reads it every 15 seconds, it could easily take over a > minute > > to read 128 messages (128 nodes). > > > So set service_result_reaper_frequency (or some such) to 2. Having it at > 15 in a large environment just won't work. > > > > More then one process can write to a FIFO > > at a time, it is just a unix file opened for append. The OS doesn't > control > > this, the user application has to. It gets worse ... if nagios spins off > > more then one plug-in that in turn writes to the FIFO, and each of those > > want to write say 128 lines of data, they can easily toast each other. > Nagios > > does have a setting to keep the number of concurrent processes to 1, but > that > > seems to be too big a hammer for this problem. In any case, locking > between > > plug-ins (and wrapping any existing ones with locks) works well. I also > set > > my nagios demon to aggresively read from the FIFO, otherwise things start > > timing out (with a service check timeout at say 60-120 seconds) > > > > While I have few local checks, they are the core of my monitoring system as > > they are resposnible for filling in all the per-node information for the > > majority of the passive checks, for example, I have a syslog monitor plugin > > that runs and parses the recent syslog messages, compares against > interesting > > patterns, and then formats a line for each node that has something > interesting > > and writes that to the FIFO, for those nodes that do not have any > interesting > > content, it formats a line that says nothing matched (if I didn't do that, > the > > service check would never fill any data in or it would go stale) Other > > plug-ins report per-node statistics and format this into the FIFO. Each > node > > has passive check definitions for these results. > > > > > >>>To avoid this, I have had to implement serious locking in all plug-ins > >> > >>and > >> > >>>not use nsca as it has no locking mechanism (that I know of). > >> > > A better solution would have been to implement a local UDP socket > mechanism. The reaper in Nagios can easily multiplex, and the receive > buffers on sockets can be dynamically increased from the program > creating it (up to at least 65536 bytes even on very old linuxes). > > > >>I'm curious about how you've done this. What exactly are you locking? > >>How is it helping? NSCA shouldn't need locking as it depends on your OS > >>to control access to the FIFO. > >> > >> > >>>Right now I am fighting with the oscp commands that can launch dozens > >> > >>of > >> > >>>copies at a time and each of these (in my case) write to a local file > >> > >>that > >> > >>>will eventually be pushed up to the master and written (while locking) > >> > >>the > >> > >>>nagios FIFO. > >>> > >>>So ... I guess my questions are: > >>> > >>>1) Should nagios be forking off more then one oscp command at a time? > >> > >>Yes, one per check. > >> > >> > >>>2) Has anyone else run into FIFO corruption because of the lack of > >>>advisory > >>> locking in all the plug-ins? > >> > > This is quite a misplaced question. The plugins just write to a > file-descriptor they think is stdout, but is really a pipe opened by > nagios (using the pipe(2) syscall) specifically for that plugin. That > pipe doesn't get filled as only one plugin is writing to it. It's nagios > itself that writes to its own FIFO. > > > >>Not here in almost 4 years of using Nagios/Netsaint. > > > > > > Again, thanks for the input. > > > > > >>-- > >>Marc > >> > >> > >>------------------------------------------------------- > >>SF.Net email is Sponsored by the Better Software Conference & EXPO > >>September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices > >>Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA > >>Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf > >>_______________________________________________ > >>Nagios-users mailing list > >>Nagios-users at lists.sourceforge.net > >>https://lists.sourceforge.net/lists/listinfo/nagios-users > >>::: Please include Nagios version, plugin version (-v) and OS when > reporting > >>any issue. > >>::: Messages without supporting info will risk being sent to /dev/null > >> > > > > > > > > > > > > > > > > > > ------------------------------------------------------- > > SF.Net email is Sponsored by the Better Software Conference & EXPO > > September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices > > Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA > > Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf > > _______________________________________________ > > Nagios-users mailing list > > Nagios-users at lists.sourceforge.net > > https://lists.sourceforge.net/lists/listinfo/nagios-users > > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > > ::: Messages without supporting info will risk being sent to /dev/null > > > > -- > Andreas Ericsson andreas.ericsson at op5.se > OP5 AB www.op5.se > Lead Developer > > > ------------------------------------------------------- > SF.Net email is Sponsored by the Better Software Conference & EXPO > September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices > Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA > Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting > any issue. > ::: Messages without supporting info will risk being sent to /dev/null > ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From f1216 at yahoo.com Mon Sep 12 14:23:13 2005 From: f1216 at yahoo.com (Fred) Date: Mon, 12 Sep 2005 05:23:13 -0700 (PDT) Subject: Scheduling (ordering) of Service checks In-Reply-To: <79BF0B53C4A28446A097D6170CF4C69955ED0C@xw2k3-hammbx-03.ads.dlh.de> References: <79BF0B53C4A28446A097D6170CF4C69955ED0C@xw2k3-hammbx-03.ads.dlh.de> Message-ID: <20050912122313.30041.qmail@web31902.mail.mud.yahoo.com> Oliver, If I were faced with this type of a problem, I would probably think about a bit differently. Nagios is good at launching checks at intervals and expects a discrete amount of work in a finite time to be accomplished. It is very good for checking and returning status. Given that, you might consider having a plug-in that oversees and reports on the progress of the outstanding phases. Its also not clear if your html application is something initiated from the nagios interface (for example, you include a link in your plug-in output and start it from there) or if this is just some activity that is happening async to nagios. It is also not clear if you would need to report a single instance of this sequence or if there can be any number of login/logout's. -FredC --- oliver.steenbuck at lhsystems.com wrote: > My situation is roughy as follows: > > > I need to check some (html) applications. This application checking needs > different steps (e.g. Login, check_1, check_2, Logout). Obviously the > "logout" step should only happen after, the "check" steps have returned. > Currently every "step" is implemented as a service. > Is it possible to order the execution of different serivce checks in a > certain way. (Not asfar as I can see but I may ofcourse be mistaken) > We are running Nagios 1.2 > > If the answer is no, has somebody already done meaningfull work in the > direction of adding such a facillity to nagios. I would be soo much > interested in any clues or insights. > > Kind Regards > Oliver Steenbuck > ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From f1216 at yahoo.com Mon Sep 12 15:54:46 2005 From: f1216 at yahoo.com (Fred) Date: Mon, 12 Sep 2005 06:54:46 -0700 (PDT) Subject: Service check delays in distributed monitor setup In-Reply-To: <20050908012059.95096.qmail@web31906.mail.mud.yahoo.com> References: <20050908012059.95096.qmail@web31906.mail.mud.yahoo.com> Message-ID: <20050912135446.65693.qmail@web31901.mail.mud.yahoo.com> I believe I have found the source of my issue around service check delays in the distributed monitoring setup. Many thanks to Andreas Ericsson for reminding me about socket and child resource requirements ... If you read the code you'll notice that the active checks also write their service results to the FIFO. This is a showstopper on the road to "scale like hell", so a few various other methods are being tested. Multiplexing several children from a single parent seems the way to go. 509 checks can run smoothly at once on a modern system (round about 1017 if you don't let the child have an stderr). The limit is set by sysconf(_SC_OPEN_MAX) / 2, or sysconf(_SC_CHILD_MAX), whichever is lowest. My file open ulimit for nagios was at 1024, the default. By doing a ulimit -n 8192 in my nagios service startup script, all things came back to normal ... services started scheduling, processes stopped hanging, it was a beautiful thing ;-) The interesting observation from this is that there seemed to be no failure messages from nagios around not being able to fork child processes or resource type failure messages in any logs. I suspect some limits where being crossed but nothing was reporting it. Thanks to all who responded. -FredC --- Fred wrote: > Unfortunately, setting the increment to a small number only worked to > set the pending state to something that looked reasonable, however, the > services still never get scheduled. > > My configuration *was* working at one point, I tweaked something and > now no matter what I do, I can't get it to start monitoring again. My > passive checks recieved from other monitor nodes all seem to get registered, > its just the active checks that run on the master (head) node never see > the light of day any more. If I regenerate the configuration to not use > distributed monitoring, it works just fine, however, that puts way too much > pressure on a single node. I removed the status.sav, but as I type > this I'm thinking I should nuke all the cache files that nagios builds, maybe > there is something that got munged in there ... > > We've used both Nagios 1.2 and now 2.0b3 (testing 2.0b4) and I have yet > to need to crack open the source and make any mods ... looks like that time > is coming ;-) > > -FredC > > --- misc at viceconsulting.co.nz wrote: > > > Hi Fred, > > > > I have encountered the exact same problem with my central Nagios server. > > It has about 1000 passive services, but only about 10 active services (the > > active services being used for the central Nagios server to self-monitor > > itself). The 1000 passive services receiving their results from the 5 > > distributed servers. > > > > When I restart the Central Nagios server, the active checks get scheduled > > for 3 hours+ into the future, but they never actually seem to run. For > > days the active checks have not actually been checking themselves. > > > > I tried changing the service_inter_check_delay_method to d for dumb, which > > appeared to schedule it when I expected (ie within about 5 mins after the > > restart) but it still didn't run them. > > > > Your idea of setting service_inter_check_delay_method=0.05 sounds good. I > > haven't had any luck getting the 10 or so active services checking on my > > central Nagios server. > > > > Is anyone able to confirm that this is a known problem in Nagios, is there > > a better workaround, is this to be fixed in 2.0 final? > > > > Fred, keep the list posted if you make further breakthroughs. > > > > Cheers > > Alex > > > > On 7 Sep 2005 at 11:03, Fred wrote: > > > > > I think I have found the source of my issue with distributed monitoring > and > > > service checks. > > > > > > It turns out that if you enable distributed monitoring, even passive > > service > > > check definitions seem to get scheduled to run when nagios starts up. If > > > you have say 10350 services (give or take one) and use smart scheduling > of > > > services, you could easily see 3+ hours between the time that the first > > service > > > is scheduled and the last one. Changing the smart schduling to "n" for > > > no delay causes the services to not be scheduled in the future, but by > the > > > time nagios processes the entire configuration file, the start time is in > > > the past and I think nagios forgets about the service so it is never > > scheduled > > > again. > > > > > > I'm currently trying a service_inter_check_delay_method=0.05 which puts > me > > > at about 3 minutes for 10,000+ services, which seems to be enough time > for > > > nagios to startup and still have its first pending service scheduled in > the > > > near future rather then the near past ... > > > > > > Does this make sense to anyone who has been messing with these > > configuration > > > settings? > > > > > > Is there a better way to do this? I.e., I would like for nagios to *not* > > > consider the passive checks in any scheduling. I actually only have a > > small > > > number of active checks which when run will populate the rest of the > > passive > > > checks for the entire cluster, the problem is that it seems the node that > I > > > run these checks on is alphabetically *after* all of the other nodes so > it > > > seems to be scheduled last and has services starting the furthest out. > > > > > > Thanks. > > > -FredC > > > > > > > > > > > > > > > > > > > > > ------------------------------------------------------- > > > SF.Net email is Sponsored by the Better Software Conference & EXPO > > > September 19-22, 2005 * San Francisco, CA * Development Lifecycle > Practices > > > Agile & Plan-Driven Development * Managing Projects & Teams * Testing & > QA > > > Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf > > > _______________________________________________ > > > Nagios-users mailing list > > > Nagios-users at lists.sourceforge.net > > > https://lists.sourceforge.net/lists/listinfo/nagios-users > > > ::: Please include Nagios version, plugin version (-v) and OS when > > reporting any issue. > > > ::: Messages without supporting info will risk being sent to /dev/null > > > > > > > > > > > > > > > > > > ------------------------------------------------------- > > SF.Net email is Sponsored by the Better Software Conference & EXPO > > September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices > > Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA > > Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf > > _______________________________________________ > > Nagios-users mailing list > > Nagios-users at lists.sourceforge.net > > https://lists.sourceforge.net/lists/listinfo/nagios-users > > ::: Please include Nagios version, plugin version (-v) and OS when > reporting > > any issue. > > ::: Messages without supporting info will risk being sent to /dev/null > > > > > > > > > > ------------------------------------------------------- > SF.Net email is Sponsored by the Better Software Conference & EXPO > September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices > Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA > Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting > any issue. > ::: Messages without supporting info will risk being sent to /dev/null > ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jhmartin at toger.us Mon Sep 12 15:56:46 2005 From: jhmartin at toger.us (Jason Martin) Date: Mon, 12 Sep 2005 06:56:46 -0700 Subject: Scheduling (ordering) of Service checks In-Reply-To: <79BF0B53C4A28446A097D6170CF4C69955ED0C@xw2k3-hammbx-03.ads.dlh.de> References: <79BF0B53C4A28446A097D6170CF4C69955ED0C@xw2k3-hammbx-03.ads.dlh.de> Message-ID: <20050912135646.GA27427@zippy.toger.us> On Mon, Sep 12, 2005 at 01:42:09PM +0200, oliver.steenbuck at lhsystems.com wrote: > Is it possible to order the execution of different serivce checks in a certain way. (Not asfar as I can see but I may ofcourse be mistaken) Nagios does not support ordering of service checks. I'd suggest writing a script that steps through those steps and returns a meaningful error code if any of them fail. -Jason Martin -- Aren't cats just widdle furry balls of love? This message is PGP/MIME signed. -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: application/pgp-signature Size: 211 bytes Desc: not available URL: From rossz at vamos-wentworth.org Mon Sep 12 17:36:22 2005 From: rossz at vamos-wentworth.org (Rossz Vamos-Wentworth) Date: Mon, 12 Sep 2005 08:36:22 -0700 Subject: parent/child and alerts Message-ID: <4325A076.2050108@vamos-wentworth.org> I need to improve how alerts are handled. This morning the internet connection on my nagios server went down. This resulted in none of the remote tests working. I had thought nagios automatically checked if the host was reachable if there was a test failure, but from alerts I'm receiving, this turns out to not be true (or I screwed up something in the configuration. What must I do to receive only a single alert for each host when the internet connection is down instead of an alert for each service? A bonus would be to supress all the alerts if the nagios server loses its connection. I could configure one of the remote servers to test my primary nagios server's connection and send out an alert if it's unreachable. -- Rossz ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Mon Sep 12 17:48:41 2005 From: ae at op5.se (Andreas Ericsson) Date: Mon, 12 Sep 2005 17:48:41 +0200 Subject: parent/child and alerts In-Reply-To: <4325A076.2050108@vamos-wentworth.org> References: <4325A076.2050108@vamos-wentworth.org> Message-ID: <4325A359.3040002@op5.se> Rossz Vamos-Wentworth wrote: > I need to improve how alerts are handled. This morning the internet > connection on my nagios server went down. This resulted in none of the > remote tests working. I had thought nagios automatically checked if the > host was reachable if there was a test failure, but from alerts I'm > receiving, this turns out to not be true (or I screwed up something in > the configuration. > You screwed up your configuration. > What must I do to receive only a single alert for each host when the > internet connection is down instead of an alert for each service? Un-screw your configuration. If you use the parents directive (in an un-screwed up way) you should only get very few alerts in a situation like this. > A > bonus would be to supress all the alerts if the nagios server loses its > connection. I could configure one of the remote servers to test my > primary nagios server's connection and send out an alert if it's > unreachable. > Just re-read the manual regarding object configuration. It'll dawn on you in a couple of tries. -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Lead Developer ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From rossz at vamos-wentworth.org Mon Sep 12 17:52:27 2005 From: rossz at vamos-wentworth.org (Rossz Vamos-Wentworth) Date: Mon, 12 Sep 2005 08:52:27 -0700 Subject: parent/child and alerts In-Reply-To: <4325A359.3040002@op5.se> References: <4325A076.2050108@vamos-wentworth.org> <4325A359.3040002@op5.se> Message-ID: <4325A43B.3030402@vamos-wentworth.org> Andreas Ericsson wrote: > Rossz Vamos-Wentworth wrote: > >> I need to improve how alerts are handled. This morning the internet >> connection on my nagios server went down. This resulted in none of >> the remote tests working. I had thought nagios automatically checked >> if the host was reachable if there was a test failure, but from alerts >> I'm receiving, this turns out to not be true (or I screwed up >> something in the configuration. >> > > You screwed up your configuration. I think I see the problem. Dumb of me. I'm testing the router that connects my nagios server to the internet. Pinging it works just fine even when there is no internet connection. What do other people use to test their connectivity? -- Rossz ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From benny at bennyvision.com Mon Sep 12 17:57:46 2005 From: benny at bennyvision.com (C. Bensend) Date: Mon, 12 Sep 2005 10:57:46 -0500 (CDT) Subject: parent/child and alerts In-Reply-To: <4325A43B.3030402@vamos-wentworth.org> References: <4325A076.2050108@vamos-wentworth.org> <4325A359.3040002@op5.se> <4325A43B.3030402@vamos-wentworth.org> Message-ID: <2917.134.244.169.17.1126540666.squirrel@webmail.stinkweasel.net> > What do other people use to test their connectivity? I ping the far end (the provider's end) serial interface of each of our DS1's. And just for good measure, in the host alias, I detail the circuit ID and support line number for each of them. Then, when the network folks lose a circuit, they have all the information they need to call in a trouble ticket with their providers, right there on their pager. Benny -- "Now, that next spring you find in your garage a creature that looks like a cross-bred badger and anaconda. A badgerconda." -- bash.org ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Mon Sep 12 17:56:55 2005 From: ae at op5.se (Andreas Ericsson) Date: Mon, 12 Sep 2005 17:56:55 +0200 Subject: parent/child and alerts In-Reply-To: <4325A43B.3030402@vamos-wentworth.org> References: <4325A076.2050108@vamos-wentworth.org> <4325A359.3040002@op5.se> <4325A43B.3030402@vamos-wentworth.org> Message-ID: <4325A547.4090201@op5.se> Rossz Vamos-Wentworth wrote: > Andreas Ericsson wrote: > >> Rossz Vamos-Wentworth wrote: >> >>> I need to improve how alerts are handled. This morning the internet >>> connection on my nagios server went down. This resulted in none of >>> the remote tests working. I had thought nagios automatically checked >>> if the host was reachable if there was a test failure, but from >>> alerts I'm receiving, this turns out to not be true (or I screwed up >>> something in the configuration. >>> >> >> You screwed up your configuration. > > > I think I see the problem. Dumb of me. I'm testing the router that > connects my nagios server to the internet. Pinging it works just fine > even when there is no internet connection. > > What do other people use to test their connectivity? > A google ping (or some such), or the default route for your router. -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Lead Developer ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From srunschke at abit.de Mon Sep 12 18:14:46 2005 From: srunschke at abit.de (srunschke at abit.de) Date: Mon, 12 Sep 2005 18:14:46 +0200 Subject: Antwort: Re: parent/child and alerts In-Reply-To: <4325A43B.3030402@vamos-wentworth.org> References: <4325A43B.3030402@vamos-wentworth.org> Message-ID: nagios-users-admin at lists.sourceforge.net schrieb am 12.09.2005 17:52:27: > I think I see the problem. Dumb of me. I'm testing the router that > connects my nagios server to the internet. Pinging it works just fine > even when there is no internet connection. Of course, you are pinging an interface local to the router. > What do other people use to test their connectivity? Make use of the parent directive, it notices when you have a complete network outage. Add every router on the way to nagios and monitor it, if one of the routers shuts down nagios will detect a network outage and do not generate alerts for the machines behind - if you did not add "unreachable" as alert option (which is usually a bad idea anyways). Example of monitoring remote-machine-X: [nagios] -> [managed-switch] -> [core-router] -> [firewall-inner-eth-ring] -> [firewall-outer-eth-ring] -> [border-router] -> [remote-site-border-router] -> [remote-site-switch] -> [remote-machine-X] This is an example chain from our nagios to a machine at a remote site. Set the parent directives in inverse order from last to first and you will have perfect network outage detection. Do not forget to remove "unreachable" from the notification options for each machine though, it's screws things up if you work with parents. regards sash -------------------------------------------------- Sascha Runschke Netzwerk Administration IT-Services ABIT AG Robert-Bosch-Str. 1 40668 Meerbusch Tel.:+49 (0) 2150.9153.226 Mobil:+49 (0) 173.5419665 mailto:SRunschke at abit.de http://www.abit.net http://www.abit-epos.net --------------------------------- Sicherheitshinweis zur E-Mail Kommunikation / Security note regarding email communication: http://www.abit.net/sicherheitshinweis.html ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From rossz at vamos-wentworth.org Mon Sep 12 18:14:23 2005 From: rossz at vamos-wentworth.org (Rossz Vamos-Wentworth) Date: Mon, 12 Sep 2005 09:14:23 -0700 Subject: parent/child and alerts In-Reply-To: <4325A547.4090201@op5.se> References: <4325A076.2050108@vamos-wentworth.org> <4325A359.3040002@op5.se> <4325A43B.3030402@vamos-wentworth.org> <4325A547.4090201@op5.se> Message-ID: <4325A95F.10000@vamos-wentworth.org> Andreas Ericsson wrote: > Rossz Vamos-Wentworth wrote: > >> Andreas Ericsson wrote: >> >>> Rossz Vamos-Wentworth wrote: >>> >>>> I need to improve how alerts are handled. This morning the internet >>>> connection on my nagios server went down. This resulted in none of >>>> the remote tests working. I had thought nagios automatically >>>> checked if the host was reachable if there was a test failure, but >>>> from alerts I'm receiving, this turns out to not be true (or I >>>> screwed up something in the configuration. >>>> >>> >>> You screwed up your configuration. >> >> >> >> I think I see the problem. Dumb of me. I'm testing the router that >> connects my nagios server to the internet. Pinging it works just fine >> even when there is no internet connection. >> >> What do other people use to test their connectivity? >> > > A google ping (or some such), or the default route for your router. > I've defined this check command: define command{ command_name check_connectivity command_line $USER1$/check_ping -H www.google.com -w 100.0,20% -c 500.0,60% } And changed my router ping test to this: define service{ use local-service host_name router service_description CONNECTIVITY is_volatile 0 check_period 24x7 max_check_attempts 3 normal_check_interval 15 retry_check_interval 1 contact_groups linux-admins notification_interval 240 notification_period 24x7 notification_options c,r check_command check_connectivity } And made all remote hosts children of "router". Notifications for this service is disabled since it wouldn't work without any connectivity. I'll configure one of my remote servers to PING test my primary nagios server and send an alert if it fails. But I think there might still be a problem. I think I need to configure the router host to be entirely dependent upon this check. I'm reading up on host/service dependencies now. -- Rossz ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From rossz at vamos-wentworth.org Mon Sep 12 18:15:27 2005 From: rossz at vamos-wentworth.org (Rossz Vamos-Wentworth) Date: Mon, 12 Sep 2005 09:15:27 -0700 Subject: parent/child and alerts In-Reply-To: <2917.134.244.169.17.1126540666.squirrel@webmail.stinkweasel.net> References: <4325A076.2050108@vamos-wentworth.org> <4325A359.3040002@op5.se> <4325A43B.3030402@vamos-wentworth.org> <2917.134.244.169.17.1126540666.squirrel@webmail.stinkweasel.net> Message-ID: <4325A99F.10603@vamos-wentworth.org> C. Bensend wrote: >>What do other people use to test their connectivity? > > > I ping the far end (the provider's end) serial interface of each of > our DS1's. And just for good measure, in the host alias, I detail > the circuit ID and support line number for each of them. I have no idea how to do that. :( -- Rossz ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Mon Sep 12 18:21:14 2005 From: marc at ena.com (Marc Powell) Date: Mon, 12 Sep 2005 11:21:14 -0500 Subject: parent/child and alerts Message-ID: > -----Original Message----- > From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users- > admin at lists.sourceforge.net] On Behalf Of Rossz Vamos-Wentworth > Sent: Monday, September 12, 2005 11:14 AM > To: nagios-users at lists.sourceforge.net > Subject: Re: [Nagios-users] parent/child and alerts > > Andreas Ericsson wrote: > > Rossz Vamos-Wentworth wrote: > > > >> Andreas Ericsson wrote: > >> > >>> Rossz Vamos-Wentworth wrote: > >>> > >>>> I need to improve how alerts are handled. This morning the internet > >>>> connection on my nagios server went down. This resulted in none of > >>>> the remote tests working. I had thought nagios automatically > >>>> checked if the host was reachable if there was a test failure, but > >>>> from alerts I'm receiving, this turns out to not be true (or I > >>>> screwed up something in the configuration. > >>>> > >>> > >>> You screwed up your configuration. > >> > >> > >> > >> I think I see the problem. Dumb of me. I'm testing the router that > >> connects my nagios server to the internet. Pinging it works just fine > >> even when there is no internet connection. > >> > >> What do other people use to test their connectivity? > >> > > > > A google ping (or some such), or the default route for your router. > > > > I've defined this check command: > > define command{ > command_name check_connectivity > command_line $USER1$/check_ping -H www.google.com -w 100.0,20% -c > 500.0,60% > } Instead of pinging Google, and I'm sure they have enough people doing it to amount to a sizable chunk of bandwidth, why not be a little more Internet friendly and ping the far side of your router connection to your ISP? -- Marc ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jsmforum at optonline.net Mon Sep 12 19:13:26 2005 From: jsmforum at optonline.net (Jeff) Date: Mon, 12 Sep 2005 13:13:26 -0400 Subject: Plugin errors Message-ID: Hey all, Had to rebuild a server over the weekend and upgraded it from RH EL2 to RH EL3. Now I'm trying to get nagios 1.2 setup again and I'm getting this error on a couple of plugins.... [root at mis02tc07927 libexec]# ./check_http ./check_http: error while loading shared libraries: libssl.so.2: cannot open shared object file: No such file or directory Anyone seen this before and have a simple solution? Thanks, Jeff ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From robmossrm at aol.com Mon Sep 12 19:17:24 2005 From: robmossrm at aol.com (Rob Moss) Date: Mon, 12 Sep 2005 18:17:24 +0100 Subject: Plugin errors In-Reply-To: References: Message-ID: <4325B824.4050800@aol.com> Jeff wrote: >Hey all, > >Had to rebuild a server over the weekend and upgraded it from RH EL2 to >RH EL3. Now I'm trying to get nagios 1.2 setup again and I'm getting >this error on a couple of plugins.... > >[root at mis02tc07927 libexec]# ./check_http >./check_http: error while loading shared libraries: libssl.so.2: cannot >open shared object file: No such file or directory > >Anyone seen this before and have a simple solution? > >Thanks, > >Jeff > > Install the openssl package..... If it's already installed, then point your library paths to where the libssl.so.2 library lives, update the /etc/ld.so.conf file and run ldconfig. rob. ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jmckeon at telaurus.com Mon Sep 12 19:32:42 2005 From: jmckeon at telaurus.com (Jeff McKeon) Date: Mon, 12 Sep 2005 13:32:42 -0400 Subject: Plugin errors Message-ID: > -----Original Message----- > From: nagios-users-admin at lists.sourceforge.net > [mailto:nagios-users-admin at lists.sourceforge.net] On Behalf > Of Rob Moss > Sent: Monday, September 12, 2005 13:17 > To: Jeff > Cc: nagios-users at lists.sourceforge.net > Subject: Re: [Nagios-users] Plugin errors > > > Jeff wrote: > > >Hey all, > > > >Had to rebuild a server over the weekend and upgraded it > from RH EL2 to > >RH EL3. Now I'm trying to get nagios 1.2 setup again and > I'm getting > >this error on a couple of plugins.... > > > >[root at mis02tc07927 libexec]# ./check_http > >./check_http: error while loading shared libraries: > libssl.so.2: cannot > >open shared object file: No such file or directory > > > >Anyone seen this before and have a simple solution? > > > >Thanks, > > > >Jeff > > > > > > Install the openssl package..... > > If it's already installed, then point your library paths to where the > libssl.so.2 library lives, update the /etc/ld.so.conf file > and run ldconfig. > > rob. Rpm -qa | grep openssl returns: openssl-0.9.7a-33.15 When I do a locate libssl I get the following: [snip] [root at mis02tc07927 libexec]# locate libssl /usr/lib/libssl3.so /usr/lib/libssl.a /usr/lib/libssl.so /lib/libssl.so.4 /lib/libssl.so.0.9.7a [/snip] No libssl.so.2..... /etc/ld.so.conf looks like this... /usr/kerberos/lib /usr/X11R6/lib /usr/lib/qt-3.1/lib Do I not have the correct libssl.so installed? Thanks, Jeff ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From robmossrm at aol.com Mon Sep 12 19:42:47 2005 From: robmossrm at aol.com (Rob Moss) Date: Mon, 12 Sep 2005 18:42:47 +0100 Subject: Plugin errors In-Reply-To: References: Message-ID: <4325BE17.6080101@aol.com> Jeff McKeon wrote: >>-----Original Message----- >>From: nagios-users-admin at lists.sourceforge.net >>[mailto:nagios-users-admin at lists.sourceforge.net] On Behalf >>Of Rob Moss >>Sent: Monday, September 12, 2005 13:17 >>To: Jeff >>Cc: nagios-users at lists.sourceforge.net >>Subject: Re: [Nagios-users] Plugin errors >> >> >>Jeff wrote: >> >> >> >>>Hey all, >>> >>>Had to rebuild a server over the weekend and upgraded it >>> >>> >>from RH EL2 to >> >> >>>RH EL3. Now I'm trying to get nagios 1.2 setup again and >>> >>> >>I'm getting >> >> >>>this error on a couple of plugins.... >>> >>>[root at mis02tc07927 libexec]# ./check_http >>>./check_http: error while loading shared libraries: >>> >>> >>libssl.so.2: cannot >> >> >>>open shared object file: No such file or directory >>> >>>Anyone seen this before and have a simple solution? >>> >>>Thanks, >>> >>>Jeff >>> >>> >>> >>> >>Install the openssl package..... >> >>If it's already installed, then point your library paths to where the >>libssl.so.2 library lives, update the /etc/ld.so.conf file >>and run ldconfig. >> >>rob. >> >> > >Rpm -qa | grep openssl returns: > >openssl-0.9.7a-33.15 > >When I do a locate libssl I get the following: > >[snip] > >[root at mis02tc07927 libexec]# locate libssl >/usr/lib/libssl3.so >/usr/lib/libssl.a >/usr/lib/libssl.so >/lib/libssl.so.4 >/lib/libssl.so.0.9.7a > >[/snip] > >No libssl.so.2..... > >/etc/ld.so.conf looks like this... > >/usr/kerberos/lib >/usr/X11R6/lib >/usr/lib/qt-3.1/lib > >Do I not have the correct libssl.so installed? > >Thanks, > >Jeff > I would say that its from upgrading to a newer version of RedHat.. Jump on rpmfind.net or your favourite source of RPM's and try to find the version of openssl that you need.. rob. -------------- next part -------------- An HTML attachment was scrubbed... URL: From werner.flamme at ufz.de Mon Sep 12 19:53:40 2005 From: werner.flamme at ufz.de (Werner Flamme) Date: Mon, 12 Sep 2005 19:53:40 +0200 Subject: Plugin errors In-Reply-To: References: Message-ID: <4325C0A4.8070009@ufz.de> Jeff McKeon schrieb am 12.09.2005 19:32: >>-----Original Message----- >>From: nagios-users-admin at lists.sourceforge.net >>[mailto:nagios-users-admin at lists.sourceforge.net] On Behalf >>Of Rob Moss >>Sent: Monday, September 12, 2005 13:17 >>To: Jeff >>Cc: nagios-users at lists.sourceforge.net >>Subject: Re: [Nagios-users] Plugin errors >> >> >>Jeff wrote: >> >> >>>Hey all, >>> >>>Had to rebuild a server over the weekend and upgraded it >> >>from RH EL2 to >> >>>RH EL3. Now I'm trying to get nagios 1.2 setup again and >> >>I'm getting >> >>>this error on a couple of plugins.... >>> >>>[root at mis02tc07927 libexec]# ./check_http >>>./check_http: error while loading shared libraries: >> >>libssl.so.2: cannot >> >>>open shared object file: No such file or directory >>> >>>Anyone seen this before and have a simple solution? >>> >>>Thanks, >>> >>>Jeff >>> >>> >> >>Install the openssl package..... >> >>If it's already installed, then point your library paths to where the >>libssl.so.2 library lives, update the /etc/ld.so.conf file >>and run ldconfig. >> >>rob. > > > Rpm -qa | grep openssl returns: > > openssl-0.9.7a-33.15 > > When I do a locate libssl I get the following: > > [snip] > > [root at mis02tc07927 libexec]# locate libssl > /usr/lib/libssl3.so > /usr/lib/libssl.a > /usr/lib/libssl.so > /lib/libssl.so.4 > /lib/libssl.so.0.9.7a > > [/snip] > > No libssl.so.2..... > > /etc/ld.so.conf looks like this... > > /usr/kerberos/lib > /usr/X11R6/lib > /usr/lib/qt-3.1/lib > > Do I not have the correct libssl.so installed? > > Thanks, > > Jeff Yes and no ;-) Yes - they may be correct for RHEL3. No - the are - obviously - not correct for your nagios-plugins. Maybe you have to recompile those plugins? They should look for the newer libraries then... ;-) HTH, Werner -- Werner Flamme, Abt. WKDV UFZ Umweltforschungszentrum Leipzig-Halle GmbH, Permoserstr. 15, 04318 Leipzig - http://www.ufz.de eMail: werner.flamme at ufz.de, Tel.: (0341) 235-3921 ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jsmforum at optonline.net Mon Sep 12 19:58:03 2005 From: jsmforum at optonline.net (Jeff) Date: Mon, 12 Sep 2005 13:58:03 -0400 Subject: Plugin errors (SOLVED) In-Reply-To: References: Message-ID: Installed the latest plugins and it all works. Thanks! Jeff > -----Original Message----- > From: nagios-users-admin at lists.sourceforge.net > [mailto:nagios-users-admin at lists.sourceforge.net] On Behalf > Of Werner Flamme > Sent: Monday, September 12, 2005 13:54 > To: nagios-users at lists.sourceforge.net > Subject: Re: [Nagios-users] Plugin errors > > > Jeff McKeon schrieb am 12.09.2005 19:32: > >>-----Original Message----- > >>From: nagios-users-admin at lists.sourceforge.net > >>[mailto:nagios-users-admin at lists.sourceforge.net] On Behalf > >>Of Rob Moss > >>Sent: Monday, September 12, 2005 13:17 > >>To: Jeff > >>Cc: nagios-users at lists.sourceforge.net > >>Subject: Re: [Nagios-users] Plugin errors > >> > >> > >>Jeff wrote: > >> > >> > >>>Hey all, > >>> > >>>Had to rebuild a server over the weekend and upgraded it > >> > >>from RH EL2 to > >> > >>>RH EL3. Now I'm trying to get nagios 1.2 setup again and > >> > >>I'm getting > >> > >>>this error on a couple of plugins.... > >>> > >>>[root at mis02tc07927 libexec]# ./check_http > >>>./check_http: error while loading shared libraries: > >> > >>libssl.so.2: cannot > >> > >>>open shared object file: No such file or directory > >>> > >>>Anyone seen this before and have a simple solution? > >>> > >>>Thanks, > >>> > >>>Jeff > >>> > >>> > >> > >>Install the openssl package..... > >> > >>If it's already installed, then point your library paths to > where the > >>libssl.so.2 library lives, update the /etc/ld.so.conf file > >>and run ldconfig. > >> > >>rob. > > > > > > Rpm -qa | grep openssl returns: > > > > openssl-0.9.7a-33.15 > > > > When I do a locate libssl I get the following: > > > > [snip] > > > > [root at mis02tc07927 libexec]# locate libssl /usr/lib/libssl3.so > > /usr/lib/libssl.a > > /usr/lib/libssl.so > > /lib/libssl.so.4 > > /lib/libssl.so.0.9.7a > > > > [/snip] > > > > No libssl.so.2..... > > > > /etc/ld.so.conf looks like this... > > > > /usr/kerberos/lib > > /usr/X11R6/lib > > /usr/lib/qt-3.1/lib > > > > Do I not have the correct libssl.so installed? > > > > Thanks, > > > > Jeff > > Yes and no ;-) > > Yes - they may be correct for RHEL3. No - the are - obviously > - not correct > for your nagios-plugins. > > Maybe you have to recompile those plugins? They should look > for the newer > libraries then... ;-) > > HTH, > Werner > > -- > Werner Flamme, Abt. WKDV > UFZ Umweltforschungszentrum Leipzig-Halle GmbH, > Permoserstr. 15, 04318 Leipzig - http://www.ufz.de > eMail: werner.flamme at ufz.de, Tel.: (0341) 235-3921 > > > > > > > ------------------------------------------------------- > SF.Net email is Sponsored by the Better Software Conference & > EXPO September 19-22, 2005 * San Francisco, CA * Development > Lifecycle Practices Agile & Plan-Driven Development * > Managing Projects & Teams * Testing & QA Security * Process > Improvement & Measurement * http://www.sqe.com/bsce5sf > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS > when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From benny at bennyvision.com Mon Sep 12 20:31:37 2005 From: benny at bennyvision.com (C. Bensend) Date: Mon, 12 Sep 2005 13:31:37 -0500 (CDT) Subject: parent/child and alerts In-Reply-To: <4325A99F.10603@vamos-wentworth.org> References: <4325A076.2050108@vamos-wentworth.org> <4325A359.3040002@op5.se> <4325A43B.3030402@vamos-wentworth.org> <2917.134.244.169.17.1126540666.squirrel@webmail.stinkweasel.net> <4325A99F.10603@vamos-wentworth.org> Message-ID: <3391.134.244.169.17.1126549897.squirrel@webmail.stinkweasel.net> >> I ping the far end (the provider's end) serial interface of each of >> our DS1's. And just for good measure, in the host alias, I detail >> the circuit ID and support line number for each of them. > > I have no idea how to do that. :( Just find out the IP address(es) for the remote end of your pipe(s) to your network provider, and create new host(s) for it. Then, use your router as a parent for the host above. If your pipe goes down, you get alerted about it, and only it. ISP | * ISP's end of your pipe (a.b.c.d) | | Your pipe to your ISP | * Your router (e.f.g.h) | Your network So, you could define two hosts, one for your router, and one for your ISP's router: define host{ use generic-host host_name upstream-t1 parents my-router alias upstream-t1 - Our internet connection address a.b.c.d check_command check-host-alive max_check_attempts 10 notification_interval 120 notification_period 24x7 notification_options d,u,r contact_groups network-admins } define host{ use generic-host host_name my-router alias my-router - Our internet connection router address e.f.g.h check_command check-host-alive max_check_attempts 10 notification_interval 120 notification_period 24x7 notification_options d,u,r contact_groups network-admins } -- "Now, that next spring you find in your garage a creature that looks like a cross-bred badger and anaconda. A badgerconda." -- bash.org ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mailings at good-it.com Mon Sep 12 21:11:15 2005 From: mailings at good-it.com (Johan Barelds) Date: Mon, 12 Sep 2005 21:11:15 +0200 Subject: Disable notifications for hostgroups Message-ID: <200509122111.15748.mailings@good-it.com> Hi all, Is there a way to disable notifications for a complete hostgroup via the gui? We do have a lot of servers in some hostgroups and when we do some major patching we wan't to disable the notifications for a hostgroups in one click. The guys who do the patching don't know any linux so disabling notifications in nagios.cmd is not an option. Thanks for any suggestions. -- Kind Regards / Met vriendelijke groet, Johan Barelds ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From kidd1270 at gmail.com Mon Sep 12 21:18:19 2005 From: kidd1270 at gmail.com (Kidd Chaos) Date: Mon, 12 Sep 2005 14:18:19 -0500 Subject: scheduled_downtime_depth Message-ID: <91a3540905091212183de1c118@mail.gmail.com> ------------------------------------------------------------------------------------------------------------------- Nagios 1.2 Question: Anyone know what scheduled_downtime_depth used for? (Its in the status log). -Thanks, Kidd ------------------------------------------------------------------------------------------------------------------- -------------- next part -------------- An HTML attachment was scrubbed... URL: From marc at ena.com Mon Sep 12 21:39:04 2005 From: marc at ena.com (Marc Powell) Date: Mon, 12 Sep 2005 14:39:04 -0500 Subject: Disable notifications for hostgroups Message-ID: > -----Original Message----- > From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users- > admin at lists.sourceforge.net] On Behalf Of Johan Barelds > Sent: Monday, September 12, 2005 2:11 PM > To: nagios-users at lists.sourceforge.net > Subject: [Nagios-users] Disable notifications for hostgroups > > Hi all, > > Is there a way to disable notifications for a complete hostgroup via the > gui? When viewing the host group or Status Summary, click on the hostgroup name (the part in parenthesis). -- Marc ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mailings at good-it.com Mon Sep 12 22:25:12 2005 From: mailings at good-it.com (Johan Barelds) Date: Mon, 12 Sep 2005 22:25:12 +0200 Subject: Disable notifications for hostgroups In-Reply-To: References: Message-ID: <200509122225.13068.mailings@good-it.com> Op maandag 12 september 2005 21:39, schreef Marc Powell: > When viewing the host group or Status Summary, click on the hostgroup > name (the part in parenthesis). Thanks Marc! That's what i was looking for. Great stuff! -- Kind Regards / Met vriendelijke groet, Johan Barelds ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Juliet_Tree at cotyinc.com Tue Sep 13 02:00:50 2005 From: Juliet_Tree at cotyinc.com (Juliet_Tree at cotyinc.com) Date: Tue, 13 Sep 2005 01:00:50 +0100 Subject: Juliet Tree/ASHFORD/UK/COTY is out of the office. Message-ID: I will be out of the office starting 12/09/2005 and will not return until 19/09/2005. I am away so please contact Paul Seal on 2261 for any urgent issues. Thankyou Thanks. ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From niceforums at yahoo.com Tue Sep 13 08:36:31 2005 From: niceforums at yahoo.com (hamideh daliri) Date: Mon, 12 Sep 2005 23:36:31 -0700 (PDT) Subject: nagios web interface work with enabled SELinux Message-ID: <20050913063632.53897.qmail@web30106.mail.mud.yahoo.com> if the SELinux is active and the enforcing mod is set the instructions below will solve the internal error of apache ... i have defined a new type for nagios ,named nagios_t , it should be defined in /etc/selinux/targeted/src/policy/types/file.te what should be written in this file is : type nagios_t,file_type,root_dir_type,sysadmfile; then add the lines below to /etc/selinux/targeted/src/policy/domains/program/apache.te : allow httpd_t nagios_t : dir { getattr search }; allow httpd_t nagios_t:file{getattr read execute\ execute_no_trans }; then you have to change the security label of nagios direstory and its contents : go to where the nagios is installed , on my box is /usr/local/ and run this command : chcon ?Rf -u root -r object_r -t nagios_t nagios/ then change the path to var/ subdirectory in nagios dir and run these commands : chcon ?Rf -u user_u -r object_r -t nagios_t status.sav chcon ?Rf -u user_u -r object_r -t nagios_t nagios.log now go to /etc/selinux/targeted/src/policy and run ' make reload ' or 'make load ' to compile the new policy and load it to load it to memory . it is ok on my box , hope it helps you too . my knowlege about SELinux isn't too much , so if you think there is any problem with what i did or it will cause any problem in future let me know, tnx . __________________________________________________ Do You Yahoo!? Tired of spam? Yahoo! Mail has the best spam protection around http://mail.yahoo.com ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From m.borsani at it.net Tue Sep 13 14:42:08 2005 From: m.borsani at it.net (Marco Borsani) Date: Tue, 13 Sep 2005 14:42:08 +0200 Subject: plugin 1.4 - check_ping Message-ID: Hi all ! I am testing plugin 1.4 on my HP-UX 11.0 and Nagios 2 system , but when I run check_ping command (release 1.45) a receive a "Memory fault(coredump)" #> ./check_ping -H IPADDRESS -w 100,10% -c 200,20% Same command using check_ping release 1.11 works fine. Do you know why? Regards Marco Borsani Unix & Monitoring System Administrator Technical Operation Tel. +39 010 4310115 Fax +39 010 4327454 E-mail: m.borsani at IT.net ITnet S.r.l. - Direzione e Coordinamento di WIND Telecomunicazioni S.p.A. Internet Service Provider Sede legale: Via C.G.Viola, 48 - 00148 Roma Dir. Centrale e Amministrativa: Via Pacinotti, 39 16151 Genova (Italy) http://www.it.net mailto:info at IT.net _______________________________________________________________ Altre sedi ITnet: MILANO tel.: +39 02 30114900 info-milano at IT.net ROMA tel.: +39 06 83116707 info-roma at IT.net _______________________________________________________________ ITnet is associated to CIX (Commercial IP eXchange) and RIPE ITnet is associated to AIIP (Associazione Italiana Internet Providers) ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From izotov at list.ru Tue Sep 13 15:25:04 2005 From: izotov at list.ru (Izotov Igor) Date: Tue, 13 Sep 2005 17:25:04 +0400 Subject: Question regarding geographically distributed monitoring Message-ID: Hello, everyone! I need to create the following monitoring process: A,B,C are monitoring servers, distributed all over the world, seeing each other. D is the monitored host. A is the "main" host, which sends notification. Notifications should be sent only in case when D is not responding to all of them (A && B && C). Can it be done by means of nagios? Thank you. ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From robmossrm at aol.com Tue Sep 13 15:03:46 2005 From: robmossrm at aol.com (Rob Moss) Date: Tue, 13 Sep 2005 14:03:46 +0100 Subject: plugin 1.4 - check_ping In-Reply-To: References: Message-ID: <4326CE32.1020005@aol.com> Marco Borsani wrote: >Hi all ! > >I am testing plugin 1.4 on my HP-UX 11.0 and Nagios 2 system , but when I >run check_ping command (release 1.45) a receive a "Memory fault(coredump)" >#> ./check_ping -H IPADDRESS -w 100,10% -c 200,20% > >Same command using check_ping release 1.11 works fine. > >Do you know why? > > This could be caused by any number of reasons.. 1. Did you compile nagios / nagios-plugins on this server (as opposed to compiling on some other server and copying the binaries over) 2. Run 'ldd /path/to/check_ping' and send back the output 3. Check if check_ping and/or /usr/bin/ping is setuid 4. Can you run /usr/bin/ping as the nagios user You could alternatively try the check_icmp program which runs the icmp ping itself, as opposed to the check_ping program which is a wrapper around /usr/bin/ping. check_icmp also takes exactly the same command arguments. Just remember that check_icmp needs to be setuid: chmod'ed to 4555 and ownership as root Cheers rob. ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Tue Sep 13 15:35:18 2005 From: ae at op5.se (Andreas Ericsson) Date: Tue, 13 Sep 2005 15:35:18 +0200 Subject: Question regarding geographically distributed monitoring In-Reply-To: References: Message-ID: <4326D596.3090800@op5.se> Izotov Igor wrote: > Hello, everyone! > I need to create the following monitoring process: > A,B,C are monitoring servers, distributed all over the world, seeing each > other. > D is the monitored host. > A is the "main" host, which sends notification. > Notifications should be sent only in case when D is not responding to all of > them (A && B && C). > Can it be done by means of nagios? > I think so, provided they all send passive check-results to each other. -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Lead Developer ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Tue Sep 13 14:56:18 2005 From: ae at op5.se (Andreas Ericsson) Date: Tue, 13 Sep 2005 14:56:18 +0200 Subject: plugin 1.4 - check_ping In-Reply-To: References: Message-ID: <4326CC72.5050201@op5.se> Marco Borsani wrote: > Hi all ! > > I am testing plugin 1.4 on my HP-UX 11.0 and Nagios 2 system , but when I > run check_ping command (release 1.45) a receive a "Memory fault(coredump)" > #> ./check_ping -H IPADDRESS -w 100,10% -c 200,20% > > Same command using check_ping release 1.11 works fine. > > Do you know why? > check_ping is sort of obsolete and not very well written. Try check_icmp instead. The latest version is at http://oss.op5.se/nagios and is known to compile cleanly under HP-UX. The plugin distro there also contains a check_ping which might work for you, although no guarantees are made. -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Lead Developer ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From izotov at list.ru Tue Sep 13 15:40:08 2005 From: izotov at list.ru (Izotov Igor) Date: Tue, 13 Sep 2005 17:40:08 +0400 Subject: Question regarding geographically distributed monitoring In-Reply-To: <4326D596.3090800@op5.se> References: <4326D596.3090800@op5.se> Message-ID: Yes, passive check results are ok, but I can't understand how to combine them. -----Original Message----- From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users-admin at lists.sourceforge.net] On Behalf Of Andreas Ericsson Sent: Tuesday, September 13, 2005 5:35 PM To: nagios-users at lists.sourceforge.net Subject: Re: [Nagios-users] Question regarding geographically distributed monitoring Izotov Igor wrote: > Hello, everyone! > I need to create the following monitoring process: > A,B,C are monitoring servers, distributed all over the world, seeing each > other. > D is the monitored host. > A is the "main" host, which sends notification. > Notifications should be sent only in case when D is not responding to all of > them (A && B && C). > Can it be done by means of nagios? > I think so, provided they all send passive check-results to each other. -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Lead Developer ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From m.borsani at it.net Tue Sep 13 15:49:43 2005 From: m.borsani at it.net (Marco Borsani) Date: Tue, 13 Sep 2005 15:49:43 +0200 Subject: R: plugin 1.4 - check_ping In-Reply-To: <4326CE32.1020005@aol.com> References: <4326CE32.1020005@aol.com> Message-ID: 1) Yes , I compiled on same server (same steps of 1.3 plugins) 2) ldd /usr/local/nagios/libexec/check_ping /usr/lib/libc.2 => /usr/lib/libc.2 /usr/lib/libdld.2 => /usr/lib/libdld.2 /usr/lib/libc.2 => /usr/lib/libc.2 3) ok 4) Yes, I can run /usr/sbin/ping as nagios user Other ideas? Actually I can not use check_icmp due some internal rules; check_icmp has 755 persissions (not 4555) , but it is running correctly. Regards Marco Borsani -}-----Messaggio originale----- -}Da: nagios-users-admin at lists.sourceforge.net -}[mailto:nagios-users-admin at lists.sourceforge.net]Per conto di Rob Moss -}Inviato: marted? 13 settembre 2005 15.04 -}Cc: NAGIOS -}Oggetto: Re: [Nagios-users] plugin 1.4 - check_ping -} -} -} -}Marco Borsani wrote: -} -}>Hi all ! -}> -}>I am testing plugin 1.4 on my HP-UX 11.0 and Nagios 2 system , but when I -}>run check_ping command (release 1.45) a receive a "Memory -}fault(coredump)" -}>#> ./check_ping -H IPADDRESS -w 100,10% -c 200,20% -}> -}>Same command using check_ping release 1.11 works fine. -}> -}>Do you know why? -}> -}> -} -}This could be caused by any number of reasons.. -} -}1. Did you compile nagios / nagios-plugins on this server (as opposed to -}compiling on some other server and copying the binaries over) -}2. Run 'ldd /path/to/check_ping' and send back the output -}3. Check if check_ping and/or /usr/bin/ping is setuid -}4. Can you run /usr/bin/ping as the nagios user -} -} -}You could alternatively try the check_icmp program which runs the icmp -}ping itself, as opposed to the check_ping program which is a wrapper -}around /usr/bin/ping. check_icmp also takes exactly the same command -}arguments. Just remember that check_icmp needs to be setuid: chmod'ed -}to 4555 and ownership as root -} -}Cheers -}rob. -} -} -} -}------------------------------------------------------- -}SF.Net email is Sponsored by the Better Software Conference & EXPO -}September 19-22, 2005 * San Francisco, CA * Development Lifecycle -}Practices -}Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA -}Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf -}_______________________________________________ -}Nagios-users mailing list -}Nagios-users at lists.sourceforge.net -}https://lists.sourceforge.net/lists/listinfo/nagios-users -}::: Please include Nagios version, plugin version (-v) and OS -}when reporting any issue. -}::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From oliver.steenbuck at lhsystems.com Tue Sep 13 15:56:12 2005 From: oliver.steenbuck at lhsystems.com (oliver.steenbuck at lhsystems.com) Date: Tue, 13 Sep 2005 15:56:12 +0200 Subject: AW: plugin 1.4 - check_ping Message-ID: <79BF0B53C4A28446A097D6170CF4C69955ED2F@xw2k3-hammbx-03.ads.dlh.de> What kind of internal rules do you mean ? Asfar as I can see check_icmp can not run when it is not setuid root as I would guess that it requires some low level access to networking stuff. -----Urspr?ngliche Nachricht----- Von: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users-admin at lists.sourceforge.net]Im Auftrag von Marco Borsani Gesendet am: Dienstag, 13. September 2005 15:50 An: Rob Moss Cc: NAGIOS Betreff: R: [Nagios-users] plugin 1.4 - check_ping 1) Yes , I compiled on same server (same steps of 1.3 plugins) 2) ldd /usr/local/nagios/libexec/check_ping /usr/lib/libc.2 => /usr/lib/libc.2 /usr/lib/libdld.2 => /usr/lib/libdld.2 /usr/lib/libc.2 => /usr/lib/libc.2 3) ok 4) Yes, I can run /usr/sbin/ping as nagios user Other ideas? Actually I can not use check_icmp due some internal rules; check_icmp has 755 persissions (not 4555) , but it is running correctly. Regards Marco Borsani -}-----Messaggio originale----- -}Da: nagios-users-admin at lists.sourceforge.net -}[mailto:nagios-users-admin at lists.sourceforge.net]Per conto di Rob Moss -}Inviato: marted? 13 settembre 2005 15.04 -}Cc: NAGIOS -}Oggetto: Re: [Nagios-users] plugin 1.4 - check_ping -} -} -} -}Marco Borsani wrote: -} -}>Hi all ! -}> -}>I am testing plugin 1.4 on my HP-UX 11.0 and Nagios 2 system , but when I -}>run check_ping command (release 1.45) a receive a "Memory -}fault(coredump)" -}>#> ./check_ping -H IPADDRESS -w 100,10% -c 200,20% -}> -}>Same command using check_ping release 1.11 works fine. -}> -}>Do you know why? -}> -}> -} -}This could be caused by any number of reasons.. -} -}1. Did you compile nagios / nagios-plugins on this server (as opposed to -}compiling on some other server and copying the binaries over) -}2. Run 'ldd /path/to/check_ping' and send back the output -}3. Check if check_ping and/or /usr/bin/ping is setuid -}4. Can you run /usr/bin/ping as the nagios user -} -} -}You could alternatively try the check_icmp program which runs the icmp -}ping itself, as opposed to the check_ping program which is a wrapper -}around /usr/bin/ping. check_icmp also takes exactly the same command -}arguments. Just remember that check_icmp needs to be setuid: chmod'ed -}to 4555 and ownership as root -} -}Cheers -}rob. -} -} -} -}------------------------------------------------------- -}SF.Net email is Sponsored by the Better Software Conference & EXPO -}September 19-22, 2005 * San Francisco, CA * Development Lifecycle -}Practices -}Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA -}Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf -}_______________________________________________ -}Nagios-users mailing list -}Nagios-users at lists.sourceforge.net -}https://lists.sourceforge.net/lists/listinfo/nagios-users -}::: Please include Nagios version, plugin version (-v) and OS -}when reporting any issue. -}::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -------------- next part -------------- An HTML attachment was scrubbed... URL: From ae at op5.se Tue Sep 13 16:00:28 2005 From: ae at op5.se (Andreas Ericsson) Date: Tue, 13 Sep 2005 16:00:28 +0200 Subject: R: plugin 1.4 - check_ping In-Reply-To: References: Message-ID: <4326DB7C.9080206@op5.se> Marco Borsani wrote: > 1) Yes , I compiled on same server (same steps of 1.3 plugins) > > 2) ldd /usr/local/nagios/libexec/check_ping > /usr/lib/libc.2 => /usr/lib/libc.2 > /usr/lib/libdld.2 => /usr/lib/libdld.2 > /usr/lib/libc.2 => /usr/lib/libc.2 > > 3) ok > > 4) Yes, I can run /usr/sbin/ping as nagios user > > Other ideas? > Downgrade the plugin-package. nagios-plugins-1.3.1 seems by far the most portable and stable version of all the plugins. You could also try the check_ping in http://oss.op5.se/nagios/op5plugins-2005-09-22.tar.gz > Actually I can not use check_icmp due some internal rules; > check_icmp has 755 persissions (not 4555) , but it is running correctly. > Strange. You could set it to 4110 if that makes anybody any happier. The nagios user can be disabled from logging in, and that'd be a safer setup than allowing the ping binary to keep on being setuid root. If you take a look at the check_icmp code, you'll also notice that I take great pains of making sure everything is calculated properly, and that it drops privileges immediately after obtaining the socket. That is far more defensive than most regular ping implementations. -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Lead Developer ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nagios at lamp.xs4all.nl Tue Sep 13 16:05:30 2005 From: nagios at lamp.xs4all.nl (Lennard bakker) Date: Tue, 13 Sep 2005 16:05:30 +0200 Subject: Question regarding geographically distributed monitoring In-Reply-To: References: Message-ID: <4326DCAA.4060906@lamp.xs4all.nl> On host A make 3 checks. D'a (active checks) D'b (passive checks, feed by host B) D'c (passive checks, feed by host C) For all 3 no notifications will be send. Now create an check_cluster D with 3 hosts (D'a, D'b and D'c). This cluster check will send a notification when 3 out of 3 will fail. Lennard Izotov Igor wrote: > Yes, passive check results are ok, but I can't understand how to combine > them. > > -----Original Message----- > From: nagios-users-admin at lists.sourceforge.net > [mailto:nagios-users-admin at lists.sourceforge.net] On Behalf Of Andreas > Ericsson > Sent: Tuesday, September 13, 2005 5:35 PM > To: nagios-users at lists.sourceforge.net > Subject: Re: [Nagios-users] Question regarding geographically distributed > monitoring > > Izotov Igor wrote: > >>Hello, everyone! >>I need to create the following monitoring process: >>A,B,C are monitoring servers, distributed all over the world, seeing each >>other. >>D is the monitored host. >>A is the "main" host, which sends notification. >>Notifications should be sent only in case when D is not responding to all > > of > >>them (A && B && C). >>Can it be done by means of nagios? >> > > > I think so, provided they all send passive check-results to each other. > ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From izotov at list.ru Tue Sep 13 16:15:31 2005 From: izotov at list.ru (Izotov Igor) Date: Tue, 13 Sep 2005 18:15:31 +0400 Subject: Question regarding geographically distributed monitoring In-Reply-To: <4326DCAA.4060906@lamp.xs4all.nl> References: <4326DCAA.4060906@lamp.xs4all.nl> Message-ID: Clumsy solution, but it will work. Thank you. And, perhaps anyone came across monitoring solutions that allow to do it in a direct way to solve the problem? -----Original Message----- From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users-admin at lists.sourceforge.net] On Behalf Of Lennard bakker Sent: Tuesday, September 13, 2005 6:06 PM To: Izotov Igor Cc: nagios-users at lists.sourceforge.net Subject: Re: [Nagios-users] Question regarding geographically distributed monitoring On host A make 3 checks. D'a (active checks) D'b (passive checks, feed by host B) D'c (passive checks, feed by host C) For all 3 no notifications will be send. Now create an check_cluster D with 3 hosts (D'a, D'b and D'c). This cluster check will send a notification when 3 out of 3 will fail. Lennard Izotov Igor wrote: > Yes, passive check results are ok, but I can't understand how to combine > them. > > -----Original Message----- > From: nagios-users-admin at lists.sourceforge.net > [mailto:nagios-users-admin at lists.sourceforge.net] On Behalf Of Andreas > Ericsson > Sent: Tuesday, September 13, 2005 5:35 PM > To: nagios-users at lists.sourceforge.net > Subject: Re: [Nagios-users] Question regarding geographically distributed > monitoring > > Izotov Igor wrote: > >>Hello, everyone! >>I need to create the following monitoring process: >>A,B,C are monitoring servers, distributed all over the world, seeing each >>other. >>D is the monitored host. >>A is the "main" host, which sends notification. >>Notifications should be sent only in case when D is not responding to all > > of > >>them (A && B && C). >>Can it be done by means of nagios? >> > > > I think so, provided they all send passive check-results to each other. > ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From andrew at 2sheds.de Tue Sep 13 16:19:55 2005 From: andrew at 2sheds.de (Andrew Miehs) Date: Tue, 13 Sep 2005 16:19:55 +0200 Subject: Nagios_grapher Message-ID: Hi all, does anyone have v1.3 of Nagios_grapher running? For some reason, it seems only to be able to create the 'ping' graphs, and not the ones for load and users... Any ideas? Thanks Andrew ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From M_Mendez at Fairfax.ca Tue Sep 13 16:26:50 2005 From: M_Mendez at Fairfax.ca (Mendez, Marc) Date: Tue, 13 Sep 2005 10:26:50 -0400 Subject: No returned output from plugin Message-ID: <64BE348AEB2C2B43891F53ABA306975636C522@FFHTOREX01.ffhl.intr> When nagios executes the following command it produces a "no output returned from plugin". $USER1$/check_snmp_int.pl -H $HOSTADDRESS$ -C snmp -n "serial1" -w 160,140 -c 175,165 -t 15 -rk The weird part is it works ok if I execute the plugin manually and although the status information displays not output nagios still reports correct status changes when they occur. Any ideas? Marc -------------- next part -------------- An HTML attachment was scrubbed... URL: From ton.voon at altinity.com Tue Sep 13 16:37:46 2005 From: ton.voon at altinity.com (Ton Voon) Date: Tue, 13 Sep 2005 15:37:46 +0100 Subject: plugin 1.4 - check_ping In-Reply-To: References: Message-ID: <224FC35A-A467-40E6-8D5E-48297B0AF10A@altinity.com> On 13 Sep 2005, at 13:42, Marco Borsani wrote: > Hi all ! > > I am testing plugin 1.4 on my HP-UX 11.0 and Nagios 2 system , but > when I > run check_ping command (release 1.45) a receive a "Memory fault > (coredump)" > #> ./check_ping -H IPADDRESS -w 100,10% -c 200,20% > > Same command using check_ping release 1.11 works fine. > > Do you know why? Marco, I've run a ./configure of the 1.3.1 plugins and "no useable ping syntax" found. My guess is that you previously ran configure with -- with-ping-command specified on the command line. What ping command do you recommend using on HP/UX? I can add this into the configure to automatically find this for the upcoming 1.4.2 release. Ton http://www.altinity.com T: +44 (0)870 787 9243 F: +44 (0)845 280 1725 Skype: tonvoon The contents of this email and any files transmitted with it are confidential and intended solely for the use of the individuals to whom it is addressed. If you are not the intended recipient or have received this e-mail in error please notify the sender and destroy this e-mail immediately. Any unauthorised copying, disclosure or distribution of the material in this e-mail is strictly prohibited. -------------- next part -------------- An HTML attachment was scrubbed... URL: From m.borsani at it.net Tue Sep 13 16:51:12 2005 From: m.borsani at it.net (Marco Borsani) Date: Tue, 13 Sep 2005 16:51:12 +0200 Subject: R: plugin 1.4 - check_ping In-Reply-To: References: Message-ID: Ton, I check both config.h (1.3.1 & 1.4) . I am not sure to understand yor question about "the command specified in config.h", but I can tell you than in both files I see same command #define PING_COMMAND "/usr/sbin/ping %s -n %d" Running check_ping -v -v -v (??) I see little differences : 1.3.1 check_ping : was not set Could not parse argumentsUsage: check_ping -H -w ,%% -c ,%% [-p packets] [-t timeout] [-L] check_ping (-h | --help) for detailed help check_ping (-V | --version) for version information 1.4 check_ping : was not set check_ping: Could not parse arguments Usage: check_ping -H -w ,% -c ,% [-p packets] [-t timeout] [-L] [-4|-6] Regards. Marco -}-----Messaggio originale----- -}Da: Ton Voon [mailto:tonvoon at mac.com] -}Inviato: martedi 13 settembre 2005 16.27 -}A: Marco Borsani -}Cc: NAGIOS -}Oggetto: Re: [Nagios-users] plugin 1.4 - check_ping -}Priorita: Alta -} -} -}Marco, -} -}I've just tried this on a HP testdrive server and I get a failure -}too. Looks like the ping command has not been picked up in the -}configure options. -} -}Can you try your 1.3.1 plugins and let me know what the command -}specified in config.h is? You may be able to do a check_ping -v -v -v -}to see the ping command used. -} -}Ton -} -}On 13 Sep 2005, at 13:42, Marco Borsani wrote: -} -}> Hi all ! -}> -}> I am testing plugin 1.4 on my HP-UX 11.0 and Nagios 2 system , but -}> when I -}> run check_ping command (release 1.45) a receive a "Memory fault -}> (coredump)" -}> #> ./check_ping -H IPADDRESS -w 100,10% -c 200,20% -}> -}> Same command using check_ping release 1.11 works fine. -}> -}> Do you know why? -}> -}> Regards -}> -}> Marco Borsani -}> Unix & Monitoring System Administrator -}> Technical Operation -}> Tel. +39 010 4310115 -}> Fax +39 010 4327454 -}> E-mail: m.borsani at IT.net -}> -}> ITnet S.r.l. - Direzione e Coordinamento di WIND Telecomunicazioni -}> S.p.A. -}> Internet Service Provider -}> Sede legale: Via C.G.Viola, 48 - 00148 Roma -}> Dir. Centrale e Amministrativa: Via Pacinotti, 39 -}> 16151 Genova (Italy) -}> -}> http://www.it.net -}> mailto:info at IT.net -}> _______________________________________________________________ -}> Altre sedi ITnet: -}> MILANO tel.: +39 02 30114900 info-milano at IT.net -}> ROMA tel.: +39 06 83116707 info-roma at IT.net -}> _______________________________________________________________ -}> ITnet is associated to CIX (Commercial IP eXchange) and RIPE -}> ITnet is associated to AIIP (Associazione Italiana Internet Providers) -}> -}> -}> -}> -}> ------------------------------------------------------- -}> SF.Net email is Sponsored by the Better Software Conference & EXPO -}> September 19-22, 2005 * San Francisco, CA * Development Lifecycle -}> Practices -}> Agile & Plan-Driven Development * Managing Projects & Teams * -}> Testing & QA -}> Security * Process Improvement & Measurement * http://www.sqe.com/ -}> bsce5sf -}> _______________________________________________ -}> Nagios-users mailing list -}> Nagios-users at lists.sourceforge.net -}> https://lists.sourceforge.net/lists/listinfo/nagios-users -}> ::: Please include Nagios version, plugin version (-v) and OS when -}> reporting any issue. -}> ::: Messages without supporting info will risk being sent to /dev/null -}> -}> -}> This message has been scanned for viruses by MailController - -}> www.MailController.altohiway.com -}> -} ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From m.borsani at it.net Tue Sep 13 16:53:15 2005 From: m.borsani at it.net (Marco Borsani) Date: Tue, 13 Sep 2005 16:53:15 +0200 Subject: R: plugin 1.4 - check_ping In-Reply-To: <224FC35A-A467-40E6-8D5E-48297B0AF10A@altinity.com> References: <224FC35A-A467-40E6-8D5E-48297B0AF10A@altinity.com> Message-ID: I did not run configure with --with-ping-command specified on the command line... I just ran ./configure I have no problem with standard 1.3.1 check_ping plugin (that point to /usr/sbin/ping command .. like 1.4 check_ping plugin) Regards Marco -----Messaggio originale----- Da: Ton Voon [mailto:ton.voon at altinity.com] Inviato: martedi 13 settembre 2005 16.38 A: Marco Borsani Cc: NAGIOS Oggetto: Re: [Nagios-users] plugin 1.4 - check_ping Priorita: Alta On 13 Sep 2005, at 13:42, Marco Borsani wrote: Hi all ! I am testing plugin 1.4 on my HP-UX 11.0 and Nagios 2 system , but when I run check_ping command (release 1.45) a receive a "Memory fault(coredump)" #> ./check_ping -H IPADDRESS -w 100,10% -c 200,20% Same command using check_ping release 1.11 works fine. Do you know why? Marco, I've run a ./configure of the 1.3.1 plugins and "no useable ping syntax" found. My guess is that you previously ran configure with --with-ping-command specified on the command line. What ping command do you recommend using on HP/UX? I can add this into the configure to automatically find this for the upcoming 1.4.2 release. Ton http://www.altinity.com T: +44 (0)870 787 9243 F: +44 (0)845 280 1725 Skype: tonvoon The contents of this email and any files transmitted with it are confidential and intended solely for the use of the individuals to whom it is addressed. If you are not the intended recipient or have received this e-mail in error please notify the sender and destroy this e-mail immediately. Any unauthorised copying, disclosure or distribution of the material in this e-mail is strictly prohibited. -------------- next part -------------- An HTML attachment was scrubbed... URL: From f1216 at yahoo.com Tue Sep 13 16:52:49 2005 From: f1216 at yahoo.com (Fred) Date: Tue, 13 Sep 2005 07:52:49 -0700 (PDT) Subject: No returned output from plugin In-Reply-To: <64BE348AEB2C2B43891F53ABA306975636C522@FFHTOREX01.ffhl.intr> References: <64BE348AEB2C2B43891F53ABA306975636C522@FFHTOREX01.ffhl.intr> Message-ID: <20050913145249.36422.qmail@web31908.mail.mud.yahoo.com> not sure how you are starting up nagios, but when I have these kinds of problems I start nagios by hand as user "nagios" (which is where mine runs) and then invoke the plug-in by rescheduling the check for right now via the web. Then watch the output from the demon going to stdout/stderr. You might also take the additional step of running strace -p"nagios-pid" -f -s512 in the background and redirecting its output so you can get an idea of what nagios is doing when it runs your plug-in. -FredC --- "Mendez, Marc" wrote: > When nagios executes the following command it produces a "no output > returned from plugin". > > > > $USER1$/check_snmp_int.pl -H $HOSTADDRESS$ -C snmp -n "serial1" -w > 160,140 -c 175,165 -t 15 -rk > > > > The weird part is it works ok if I execute the plugin manually and > although the status information displays not output nagios still reports > correct status changes when they occur. > > > > Any ideas? > > > > Marc > > > > > > ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From losilla at unizar.es Tue Sep 13 19:05:39 2005 From: losilla at unizar.es (Guillermo Los=?ISO-8859-1?B?aWxsYSBBbmFk824=?=) Date: Tue, 13 Sep 2005 19:05:39 +0200 Subject: is Nagios what I need? Message-ID: <1126631139.432706e31ce09@webmail.unizar.es> Dear all, I work in a small institution belonging to a bigger organism which manages and monitors the overall network and connects it to the Internet. We suspect this organism is controlling/priorizing our network traffic since we have detected strange behaviours (non-simetric bandwidths, timeouts, slower bandwidh than expected...) when accesing from/to the Internet with different services/protocols. We are looking for a network monitoring tool which is able to confirm that some network traffic control mechanism is being performed between our gateway and the Internet. My question is simple: is Nagios that tool? Is there any plugin giving this functionality? If not, could you tell me which tool have the feature we look for? Thanks, Guillermo ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From andrew at profitability.net Tue Sep 13 19:10:09 2005 From: andrew at profitability.net (Andrew Cruse) Date: Tue, 13 Sep 2005 13:10:09 -0400 Subject: is Nagios what I need? In-Reply-To: <1126631139.432706e31ce09@webmail.unizar.es> References: <1126631139.432706e31ce09@webmail.unizar.es> Message-ID: nagios-users-admin at lists.sourceforge.net wrote: > Dear all, > I work in a small institution belonging to a bigger > organism which manages and monitors the overall network and connects > it to the Internet. We suspect this organism is > controlling/priorizing our network traffic since we have detected > strange behaviours (non-simetric bandwidths, timeouts, slower > bandwidh than expected...) when accesing from/to the Internet with > different services/protocols. We are looking for a network > monitoring tool which is able to confirm that some network traffic > control mechanism is being performed between our gateway and the > Internet. My question is simple: is Nagios that tool? Is there any > plugin giving this functionality? If not, could you tell me which > tool have the feature we look for? Nagios probably isn't going to be what you're looking for. You'd do better having a look at PCHAR or TTCP. Andrew ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From noyler at khimetrics.com Tue Sep 13 19:56:33 2005 From: noyler at khimetrics.com (Nathan Oyler) Date: Tue, 13 Sep 2005 10:56:33 -0700 Subject: Nagios_grapher Message-ID: <59B15593F41BD24591D59436E7226EAD0279A229@Khiphx2.khimetrics.com> It only creates graphs for ping? Do you define ngraph's for your other service names? What is the ngraph statement for Load? > -----Original Message----- > From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users- > admin at lists.sourceforge.net] On Behalf Of Andrew Miehs > Sent: Tuesday, September 13, 2005 7:20 AM > To: nagios-users at lists.sourceforge.net > Subject: [Nagios-users] Nagios_grapher > > Hi all, > > does anyone have v1.3 of Nagios_grapher running? > > For some reason, it seems only to be able to create the 'ping' > graphs, and not the ones for load and users... > > Any ideas? > > Thanks > > Andrew > > > ------------------------------------------------------- > SF.Net email is Sponsored by the Better Software Conference & EXPO > September 19-22, 2005 * San Francisco, CA * Development Lifecycle > Practices > Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA > Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From dist-list at LEXUM.UMontreal.CA Tue Sep 13 23:01:20 2005 From: dist-list at LEXUM.UMontreal.CA (FM) Date: Tue, 13 Sep 2005 17:01:20 -0400 Subject: advice to check WAN connection Message-ID: <43273E20.2050904@lexum.umontreal.ca> Hello, I created : # 'check_internet' command definition define command{ command_name check_internet command_line $USER1$/check_http -H www.google.com define host{ use generic-srv ; Name of host template to use host_name Internet connection address 64.233.167.99 check_command check_internet max_check_attempts 10 notification_interval 120 notification_period 24x7 notification_options d,u,r contact_groups administrators } now I want that the statusmap looks like : nagios --FIREWALL -- SERVEUR | Internet Connection nagios--Firewall--serveur is already configured Thanks !!! ------------------------------------------------------- SF.Net email is sponsored by: Tame your development challenges with Apache's Geronimo App Server. Download it for free - -and be entered to win a 42" plasma tv or your very own Sony(tm)PSP. Click here to play: http://sourceforge.net/geronimo.php _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From sghosh at sghosh.org Tue Sep 13 23:40:03 2005 From: sghosh at sghosh.org (Subhendu Ghosh) Date: Tue, 13 Sep 2005 17:40:03 -0400 (EDT) Subject: advice to check WAN connection In-Reply-To: <43273E20.2050904@lexum.umontreal.ca> References: <43273E20.2050904@lexum.umontreal.ca> Message-ID: On Tue, 13 Sep 2005, FM wrote: > Hello, > I created : > # 'check_internet' command definition > define command{ > command_name check_internet > command_line $USER1$/check_http -H www.google.com > > define host{ > use generic-srv ; Name of host template to use > > host_name Internet connection > address 64.233.167.99 > check_command check_internet > max_check_attempts 10 > notification_interval 120 > notification_period 24x7 > notification_options d,u,r > contact_groups administrators > } > > > now I want that the statusmap looks like : > > nagios --FIREWALL -- SERVEUR > | > Internet Connection > > nagios--Firewall--serveur is already configured > > > Thanks !!! > Don't use google.com - it is not net friendly Check the interface of your upstream router. The above check also depends on your local dns resolver succeeding. -- -sg ------------------------------------------------------- SF.Net email is sponsored by: Tame your development challenges with Apache's Geronimo App Server. Download it for free - -and be entered to win a 42" plasma tv or your very own Sony(tm)PSP. Click here to play: http://sourceforge.net/geronimo.php _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From misc at viceconsulting.co.nz Tue Sep 13 23:39:30 2005 From: misc at viceconsulting.co.nz (misc at viceconsulting.co.nz) Date: Wed, 14 Sep 2005 09:39:30 +1200 (NZST) Subject: Nagios spawning rogue nagios processes - UPDATE Message-ID: <54111.127.0.0.1.1126647570.squirrel@www.goldenfields.co.nz> I posted a message a few days ago regarding Nagios spawning multiple rogue nagios processes eventually crashing the Nagios server. Basically it seems if Nagios can not process performance data fast enough it will keep spawning more and more nagios processes eventually crashing. It also seems Nagios can only invoke one perfdata-processing script at a time, and must wait for the first one to finish before the second one can start. I had Nagios invoking this dummy Perl script using the service_perfdata_command directive: #perl script sleep 30; This causes Nagios to go into a spin invoking more and more Nagios processes and eventually crashing the server. Memory and swap get chewed to 0. I solved the problem in this case by switching to using file-based perfdata via the following directives: service_perfdata_file_processing_command service_perfdata_file service_perfdata_file_processing_interval=30 Now the Nagios server is running a lot better (CPU was at 100% all the time, now its about 20%). If you are doing perfdata processing and you have more than a couple of hundred services consider switching to a file-based perfdata. Nonetheless I don't believe Nagios should behave as mentioned above irrespective of how you configure your perfdata processing. Cheers Alex ------------------------------------------------------- SF.Net email is sponsored by: Tame your development challenges with Apache's Geronimo App Server. Download it for free - -and be entered to win a 42" plasma tv or your very own Sony(tm)PSP. Click here to play: http://sourceforge.net/geronimo.php _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From pla at softflare.com Tue Sep 13 23:54:02 2005 From: pla at softflare.com (Paul L. Allen) Date: Tue, 13 Sep 2005 22:54:02 +0100 Subject: Question regarding geographically distributed monitoring In-Reply-To: References: Message-ID: <20050913215402.8905.qmail@mullet.softflare.net> Izotov Igor writes: > Clumsy solution, but it will work. Thank you. > And, perhaps anyone came across monitoring solutions that allow to do > it in a direct way to solve the problem? Given what he's doing, passive monitoring is a good idea to get around scalability problems anyway. But with passive monitoring he has the added advantage that if any of the passive monitors submit a good check result then his master nagios will say things are working correctly. Only if NONE of the slave monitors submit a passive check result of OK and he doesn't have staleness checking will things go wrong. For the pedants, if one of the remote monitors says things are good and another says things are bad then you're going to see flapping. There IS a problem but it's not where Nagios says it is. If you want perfect results then you'll have to hire many perfect human beings to monitor each server. If you want "something, somewhere is borked, and perhaps you'll need somebody smarter than the average point-and-click monkey to figure out where" then Nagios is adequate. If you want perfection then bge prepared to pay the cost. -- Paul Allen Softflare Support ------------------------------------------------------- SF.Net email is sponsored by: Tame your development challenges with Apache's Geronimo App Server. Download it for free - -and be entered to win a 42" plasma tv or your very own Sony(tm)PSP. Click here to play: http://sourceforge.net/geronimo.php _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From sghosh at sghosh.org Wed Sep 14 00:42:43 2005 From: sghosh at sghosh.org (Subhendu Ghosh) Date: Tue, 13 Sep 2005 18:42:43 -0400 (EDT) Subject: Question regarding geographically distributed monitoring In-Reply-To: <20050913215402.8905.qmail@mullet.softflare.net> References: <20050913215402.8905.qmail@mullet.softflare.net> Message-ID: >If you want "something, somewhere is borked, and perhaps > you'll need somebody smarter than the average point-and-click monkey > to figure out where" then Nagios is adequate. I think this is a keeper :) -- -sg ------------------------------------------------------- SF.Net email is sponsored by: Tame your development challenges with Apache's Geronimo App Server. Download it for free - -and be entered to win a 42" plasma tv or your very own Sony(tm)PSP. Click here to play: http://sourceforge.net/geronimo.php _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From potus98 at yahoo.com Wed Sep 14 02:27:23 2005 From: potus98 at yahoo.com (John Christian) Date: Tue, 13 Sep 2005 17:27:23 -0700 (PDT) Subject: perfparse doesn't display hosts, services Message-ID: <20050914002723.90283.qmail@web54706.mail.yahoo.com> Hi Gurus, I'm trying to integrate PerfParse with Nagios. When I access the PerfData Graphs page, I get the PerfParse logo, but no hosts or services are listed. I read that these fields should be auto-populated if the performance data is being delivered correctly. As a result, I suspect the performance data is not making it all the way to PerfParse. Nagios is logging performance data when I point it to a file. But when I connect Nagios to PerfParse via a pipe, I don't know how to check that the performance data is going into the pipe or arriving in the MySQL database. How do I "look" at the pipe? Or view the contents of the 'perfparse' database in MySQL? Any suggestions on what may be broken or how to continue diagnosing the problem? TIA! -John Additional Info: Nagios 2.04b PerfParse v0.105.6 SunOS 5.9 Generic_112233-12 Sun-Fire-880 It seems PerfParse can connect to the MySQL database: /usr/local/nagios/bin/check_perfparse_version OK Perfparse Database Version Correct: 0.19. | pp_ver=0.19 true_ver=0.19 I start perfparsed first and it creates a pipe in /usr/local/nagios/var: prw-r----- 1 root other 0 Sep 13 12:08 perfdata-service.log ...Then I start nagios and it seems to run fine, perform checks, and provide the normal web interface. Relevent entries from nagios.cfg: cfg_file=/usr/local/nagios/etc/nagios_perfparse.cfg nagios_user=nagios nagios_group=nagios perfdata_timeout=5 process_performance_data=1 host_perfdata_command=process-host-perfdata service_perfdata_command=process-service-perfdata service_perfdata_file=/usr/local/nagios/var/perfdata-service.log host_perfdata_file_template=[HOSTPERFDATA]\t$TIMET$\t$HOSTNAME$\t$HOSTEXECUTIONTIME$\t$HOSTOUTPUT$\t$HOSTPERFDATA$ service_perfdata_file_template=[SERVICEPERFDATA]\t$TIMET$\t$HOSTNAME$\t$SERVICEDESC$\t$SERVICEEXECUTIONTIME$\t$SERVICELATENCY$\t$SERVICEOUTPUT$\t$SERVICEPERFDATA$ #host_perfdata_file_mode=a #service_perfdata_file_mode=a #host_perfdata_file_processing_interval=0 #service_perfdata_file_processing_interval=0 #host_perfdata_file_processing_command=process-host-perfdata-file #service_perfdata_file_processing_command=process-service-perfdata-file Relevent entries from nagios_perfparse.cfg: define command{ command_name process-service-perfdata command_line /usr/local/nagios/bin/perfparse_nagios_pipe_command.pl "$TIMET$" "$HOSTNAME$" "$SERVICEDESC$" "$OUTPUT$" "$SERVICESTATE$" "$PERFDATA$" } define command{ command_name process-host-perfdata command_line /usr/local/nagios/bin/perfparse_nagios_pipe_command.pl "$TIMET$" "$HOSTNAME$" "$OUTPUT$" "$PERFDATA$" } Relevent entries from perfparse.cfg: Service_Log = "|/usr/local/nagios/var/perfdata-service.log" Host_Log = "|/usr/local/nagios/var/perfdata-host.log" Storage_Modules_Load = "mysql" __________________________________________________ Do You Yahoo!? Tired of spam? Yahoo! Mail has the best spam protection around http://mail.yahoo.com ------------------------------------------------------- SF.Net email is sponsored by: Tame your development challenges with Apache's Geronimo App Server. Download it for free - -and be entered to win a 42" plasma tv or your very own Sony(tm)PSP. Click here to play: http://sourceforge.net/geronimo.php _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From potus98 at yahoo.com Wed Sep 14 02:51:22 2005 From: potus98 at yahoo.com (John Christian) Date: Tue, 13 Sep 2005 17:51:22 -0700 (PDT) Subject: perfparse doesn't display hosts, services - UPDATE In-Reply-To: <20050914002723.90283.qmail@web54706.mail.yahoo.com> References: <20050914002723.90283.qmail@web54706.mail.yahoo.com> Message-ID: <20050914005122.89362.qmail@web54702.mail.yahoo.com> Oops. I should also have included this handy message from nagios.log: [1126658872] Nagios 2.0b4 starting... (PID=28393) [1126658872] LOG VERSION: 2.0 [1126658872] Finished daemonizing... (New PID=28394) [1126658872] Warning: File '/usr/local/nagios/var/perfdata-service.log' could not be opened - service performance data will not be written to file! --- John Christian wrote: > Hi Gurus, > > I'm trying to integrate PerfParse with Nagios. When > I > access the PerfData Graphs page, I get the PerfParse > logo, but no hosts or services are listed. I read > that > these fields should be auto-populated if the > performance data is being delivered correctly. As a > result, I suspect the performance data is not making > it all the way to PerfParse. > > Nagios is logging performance data when I point it > to > a file. But when I connect Nagios to PerfParse via a > pipe, I don't know how to check that the performance > data is going into the pipe or arriving in the MySQL > database. How do I "look" at the pipe? Or view the > contents of the 'perfparse' database in MySQL? > > Any suggestions on what may be broken or how to > continue diagnosing the problem? TIA! -John > > Additional Info: > > Nagios 2.04b > PerfParse v0.105.6 > SunOS 5.9 Generic_112233-12 Sun-Fire-880 > > It seems PerfParse can connect to the MySQL > database: > /usr/local/nagios/bin/check_perfparse_version > OK Perfparse Database Version Correct: 0.19. | > pp_ver=0.19 true_ver=0.19 > > I start perfparsed first and it creates a pipe in > /usr/local/nagios/var: > prw-r----- 1 root other 0 Sep 13 > 12:08 > perfdata-service.log > ...Then I start nagios and it seems to run fine, > perform checks, and provide the normal web > interface. > > > Relevent entries from nagios.cfg: > > cfg_file=/usr/local/nagios/etc/nagios_perfparse.cfg > nagios_user=nagios > nagios_group=nagios > perfdata_timeout=5 > process_performance_data=1 > host_perfdata_command=process-host-perfdata > service_perfdata_command=process-service-perfdata > service_perfdata_file=/usr/local/nagios/var/perfdata-service.log > host_perfdata_file_template=[HOSTPERFDATA]\t$TIMET$\t$HOSTNAME$\t$HOSTEXECUTIONTIME$\t$HOSTOUTPUT$\t$HOSTPERFDATA$ > service_perfdata_file_template=[SERVICEPERFDATA]\t$TIMET$\t$HOSTNAME$\t$SERVICEDESC$\t$SERVICEEXECUTIONTIME$\t$SERVICELATENCY$\t$SERVICEOUTPUT$\t$SERVICEPERFDATA$ > #host_perfdata_file_mode=a > #service_perfdata_file_mode=a > #host_perfdata_file_processing_interval=0 > #service_perfdata_file_processing_interval=0 > #host_perfdata_file_processing_command=process-host-perfdata-file > #service_perfdata_file_processing_command=process-service-perfdata-file > > > Relevent entries from nagios_perfparse.cfg: > > define command{ > command_name process-service-perfdata > command_line > /usr/local/nagios/bin/perfparse_nagios_pipe_command.pl > "$TIMET$" "$HOSTNAME$" "$SERVICEDESC$" "$OUTPUT$" > "$SERVICESTATE$" "$PERFDATA$" > } > > define command{ > command_name process-host-perfdata > command_line > /usr/local/nagios/bin/perfparse_nagios_pipe_command.pl > "$TIMET$" "$HOSTNAME$" "$OUTPUT$" "$PERFDATA$" > } > > > Relevent entries from perfparse.cfg: > > Service_Log = > "|/usr/local/nagios/var/perfdata-service.log" > Host_Log = > "|/usr/local/nagios/var/perfdata-host.log" > Storage_Modules_Load = "mysql" > > > > __________________________________________________ > Do You Yahoo!? > Tired of spam? Yahoo! Mail has the best spam > protection around > http://mail.yahoo.com > > > ------------------------------------------------------- > SF.Net email is sponsored by: > Tame your development challenges with Apache's > Geronimo App Server. > Download it for free - -and be entered to win a 42" > plasma tv or your very > own Sony(tm)PSP. Click here to play: > http://sourceforge.net/geronimo.php > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version > (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being > sent to /dev/null > __________________________________ Yahoo! Mail - PC Magazine Editors' Choice 2005 http://mail.yahoo.com ------------------------------------------------------- SF.Net email is sponsored by: Tame your development challenges with Apache's Geronimo App Server. Download it for free - -and be entered to win a 42" plasma tv or your very own Sony(tm)PSP. Click here to play: http://sourceforge.net/geronimo.php _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From charskall at gmail.com Wed Sep 14 02:35:45 2005 From: charskall at gmail.com (Char Skall) Date: Tue, 13 Sep 2005 20:35:45 -0400 Subject: odd permission problem Message-ID: <5506e8e405091317356adba7ef@mail.gmail.com> I have Nagios running fine. All of my orginal settings work fine and everything is being checked like i want. I can make adjustments to services.cfg and other cfg files and restart nagios without issue. Until I added a new host. I created thenew host and added it to the appropriate host group. The new host appears perfectly in the scheduling que and even shows its status as "ok". The problem is it will not appear in host group overview and when I click on it I get the "It appears as though you do not have permission to view information for this host...". I am logging in as the only user thats been entered "nagiosadmin" and have set that user for all of the access permissions in cgi.cfg . I also made sure the nagiosadmin is listed as the contact for all host groups. I can access everything else without problems just this one newly added host. -------------- next part -------------- An HTML attachment was scrubbed... URL: From drew at gothambus.com Wed Sep 14 05:08:12 2005 From: drew at gothambus.com (Drew Linsalata) Date: Tue, 13 Sep 2005 23:08:12 -0400 Subject: Compile problem - 2.0b4 on FreeBSD 5.1 Message-ID: <4327941C.4070601@gothambus.com> After a pretty simple "configure" run, we're running into compile issues with 2.0b4 on a FreeBSD 5.1 box. A "make all" results in: sky1# make all cd ./base && make gcc -g -O2 -DHAVE_CONFIG_H -DNSCORE -c broker.c In file included from ../include/config.h:114, from broker.c:26: /usr/include/sys/resource.h:61: field `ru_utime' has incomplete type /usr/include/sys/resource.h:62: field `ru_stime' has incomplete type /usr/include/sys/resource.h:79: confused by earlier errors, bailing out *** Error code 1 Stop in /usr/local/src/nagios-2.0b4/base. *** Error code 1 Stop in /usr/local/src/nagios-2.0b4. We don't have the GD/PNG/JPEG stuff on that box yet, but I dont see where that would cause compile errors. Has anyone seen this before? -- Drew Linsalata The Gotham Bus Company, Inc. Dedicated Servers and Colocation Solutions Long Island, New York http://www.gothambus.com ------------------------------------------------------- SF.Net email is sponsored by: Tame your development challenges with Apache's Geronimo App Server. Download it for free - -and be entered to win a 42" plasma tv or your very own Sony(tm)PSP. Click here to play: http://sourceforge.net/geronimo.php _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jjk_saji at yahoo.com Wed Sep 14 07:39:08 2005 From: jjk_saji at yahoo.com (John Joseph) Date: Wed, 14 Sep 2005 06:39:08 +0100 (BST) Subject: No notifications in "nagios.log" , In-Reply-To: <432684DA.70606@lamp.xs4all.nl> References: <432684DA.70606@lamp.xs4all.nl> Message-ID: <20050914053908.54136.qmail@web40824.mail.yahoo.com> Hi Thanks to all the members who had given we advice , I also got help from http://meulie.net/forum_viewtopic.php?21.3339 I had to enable notification , which I did from nagios interface [ Previously I was trying out with oreon ] and I do not know what was the reason for not working that time please check the link http://nagios.sourceforge.net/docs/1_0/xodtemplate.html#retention_notes Thanks Joseph John --- Lennard Bakker wrote: > -----BEGIN PGP SIGNED MESSAGE----- > Hash: SHA1 > > John Joseph wrote: > > Hi > > When I was trouble shooting for the not > receiving > > email -notification , I found that my > > ? usr/local/nagios/var/nagios.log? do not have > any > > notification in it , while it has other details > such > > as history , status , trends > > > I think if I am able to find out the > reason > > , why there is no info of notification in > nagios.log , > > I can solve my mail problem > > Did you find what the problem was. I have the same > problem here. It > looks like Nagios isn't trying to send > notifications. Non will appear in > the log file of nagios. As of nagios doesn't need > to send a message. > > All messages are enabled, all time stamps are 24x7.. > > Lennard > > -----BEGIN PGP SIGNATURE----- > Version: GnuPG v1.4.0 (MingW32) > Comment: Using GnuPG with Thunderbird - > http://enigmail.mozdev.org > > iD8DBQFDJoTaB3IFhTJpAVkRAqcLAJ4kZj9nTMV2Pqtx9qkT9unpXzaPFwCg7SvO > bzVCOT5lZX6zWi7e2tvQwRw= > =3L0N > -----END PGP SIGNATURE----- > ___________________________________________________________ Yahoo! Messenger - NEW crystal clear PC to PC calling worldwide with voicemail http://uk.messenger.yahoo.com ------------------------------------------------------- SF.Net email is sponsored by: Tame your development challenges with Apache's Geronimo App Server. Download it for free - -and be entered to win a 42" plasma tv or your very own Sony(tm)PSP. Click here to play: http://sourceforge.net/geronimo.php _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mohamed.azizi at belgacom.be Wed Sep 14 09:39:29 2005 From: mohamed.azizi at belgacom.be (mohamed.azizi at belgacom.be) Date: Wed, 14 Sep 2005 09:39:29 +0200 Subject: NRPE: unable to read output Message-ID: <5F3043372274524C967EB597009D2A0E022F0549@AE0008.BGC.NET> Dear friend , I have in Nagios for some checks with the following error :NRPE unable to read output How can I solve this problem Thanks Mohamed **** DISCLAIMER **** http://www.belgacom.be/maildisclaimer -------------- next part -------------- An HTML attachment was scrubbed... URL: From elizar.palad at gmail.com Wed Sep 14 10:36:04 2005 From: elizar.palad at gmail.com (Elizar M. Palad) Date: Wed, 14 Sep 2005 16:36:04 +0800 Subject: problem with check_disk_remote (solaris plugin) Message-ID: Hi everyone! Why is it that when i run the plugin manually, ie, commanline, the scripts executes without a problem.. example # ../libexec/check_disk_remote -e rsh -H k3tsyn1 -w 90 -c 98 OK: All Filesystems are below threshold (90/98%) | /=5%;;;0;100 /usr=59%;;;0;100 /var=30%;;;0;100 /opt=58%;;;0;100 But when i do this in my command.cfg file: define command{ command_name check_remote_disk command_line $USER1$/check_disk_remote -e rsh -H k3tsyn1 -w 90 -c 98 } (yes, i know that's the macro thing.. :) its not working also, i put it like that to simulate manual execution) the logs says: [1126686654] SERVICE ALERT: LTX k3tsyn1 ;Disk Usage;UNKNOWN;SOFT;1;(No output returned from plugin) [1126686714] SERVICE ALERT: LTX k3tsyn1 ;Disk Usage;UNKNOWN;SOFT;2;(No output returned from plugin) Thanks! -- ---- Don't Tell Me How Hard You Work.. Show Me How Much You'd Accomplished.. -------------- next part -------------- An HTML attachment was scrubbed... URL: From elizar.palad at gmail.com Wed Sep 14 10:40:49 2005 From: elizar.palad at gmail.com (Elizar M. Palad) Date: Wed, 14 Sep 2005 16:40:49 +0800 Subject: problem with check_disk_remote (solaris plugin) In-Reply-To: <79BF0B53C4A28446A097D6170CF4C69955ED3B@xw2k3-hammbx-03.ads.dlh.de> References: <79BF0B53C4A28446A097D6170CF4C69955ED3B@xw2k3-hammbx-03.ads.dlh.de> Message-ID: yes actually, no. I am root in the commandline.. ? On 9/14/05, oliver.steenbuck at lhsystems.com wrote: > > maybe your try from the commandline was not as the nagios user ? > > -----Urspr?ngliche Nachricht----- > *Von:* nagios-users-admin at lists.sourceforge.net [mailto: > nagios-users-admin at lists.sourceforge.net]*Im Auftrag von* Elizar M. Palad > *Gesendet am:* Mittwoch, 14. September 2005 10:36 > *An:* nagios-users at lists.sourceforge.net > *Betreff:* [Nagios-users] problem with check_disk_remote (solaris plugin) > Hi everyone! > Why is it that when i run the plugin manually, ie, commanline, > the scripts executes without a problem.. example > # ../libexec/check_disk_remote -e rsh -H k3tsyn1 -w 90 -c 98 > OK: All Filesystems are below threshold (90/98%) | /=5%;;;0;100 > /usr=59%;;;0;100 /var=30%;;;0;100 /opt=58%;;;0;100 > But when i do this in my command.cfg file: > define command{ > command_name check_remote_disk > command_line $USER1$/check_disk_remote -e rsh -H k3tsyn1 -w 90 -c 98 > } > (yes, i know that's the macro thing.. :) its not working also, i put it > like > that to simulate manual execution) > the logs says: > [1126686654] SERVICE ALERT: LTX k3tsyn1 ;Disk Usage;UNKNOWN;SOFT;1;(No > output returned from plugin) > [1126686714] SERVICE ALERT: LTX k3tsyn1 ;Disk Usage;UNKNOWN;SOFT;2;(No > output returned from plugin) > Thanks! > > -- > ---- > Don't Tell Me How Hard You Work.. > Show Me How Much You'd Accomplished.. > > -- ---- Don't Tell Me How Hard You Work.. Show Me How Much You'd Accomplished.. -------------- next part -------------- An HTML attachment was scrubbed... URL: From oliver.steenbuck at lhsystems.com Wed Sep 14 10:39:52 2005 From: oliver.steenbuck at lhsystems.com (oliver.steenbuck at lhsystems.com) Date: Wed, 14 Sep 2005 10:39:52 +0200 Subject: AW: problem with check_disk_remote (solaris plugin) Message-ID: <79BF0B53C4A28446A097D6170CF4C69955ED3B@xw2k3-hammbx-03.ads.dlh.de> maybe your try from the commandline was not as the nagios user ? -----Urspr?ngliche Nachricht----- Von: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users-admin at lists.sourceforge.net]Im Auftrag von Elizar M. Palad Gesendet am: Mittwoch, 14. September 2005 10:36 An: nagios-users at lists.sourceforge.net Betreff: [Nagios-users] problem with check_disk_remote (solaris plugin) Hi everyone! Why is it that when i run the plugin manually, ie, commanline, the scripts executes without a problem.. example # ../libexec/check_disk_remote -e rsh -H k3tsyn1 -w 90 -c 98 OK: All Filesystems are below threshold (90/98%) | /=5%;;;0;100 /usr=59%;;;0;100 /var=30%;;;0;100 /opt=58%;;;0;100 But when i do this in my command.cfg file: define command{ command_name check_remote_disk command_line $USER1$/check_disk_remote -e rsh -H k3tsyn1 -w 90 -c 98 } (yes, i know that's the macro thing.. :) its not working also, i put it like that to simulate manual execution) the logs says: [1126686654] SERVICE ALERT: LTX k3tsyn1 ;Disk Usage;UNKNOWN;SOFT;1;(No output returned from plugin) [1126686714] SERVICE ALERT: LTX k3tsyn1 ;Disk Usage;UNKNOWN;SOFT;2;(No output returned from plugin) Thanks! -- ---- Don't Tell Me How Hard You Work.. Show Me How Much You'd Accomplished.. -------------- next part -------------- An HTML attachment was scrubbed... URL: From oliver.steenbuck at lhsystems.com Wed Sep 14 10:44:17 2005 From: oliver.steenbuck at lhsystems.com (oliver.steenbuck at lhsystems.com) Date: Wed, 14 Sep 2005 10:44:17 +0200 Subject: AW: problem with check_disk_remote (solaris plugin) Message-ID: <79BF0B53C4A28446A097D6170CF4C69955ED3C@xw2k3-hammbx-03.ads.dlh.de> is the plugin also working if you are nagios and start it from the commandline ? -----Urspr?ngliche Nachricht----- Von: Elizar M. Palad [mailto:elizar.palad at gmail.com] Gesendet am: Mittwoch, 14. September 2005 10:41 An: STEENBUCK, OLIVER Cc: nagios-users at lists.sourceforge.net Betreff: Re: [Nagios-users] problem with check_disk_remote (solaris plugin) yes actually, no. I am root in the commandline.. ? On 9/14/05, oliver.steenbuck at lhsystems.com < oliver.steenbuck at lhsystems.com > wrote: maybe your try from the commandline was not as the nagios user ? -----Urspr?ngliche Nachricht----- Von: nagios-users-admin at lists.sourceforge.net [mailto: nagios-users-admin at lists.sourceforge.net ]Im Auftrag von Elizar M. Palad Gesendet am: Mittwoch, 14. September 2005 10:36 An: nagios-users at lists.sourceforge.net Betreff: [Nagios-users] problem with check_disk_remote (solaris plugin) Hi everyone! Why is it that when i run the plugin manually, ie, commanline, the scripts executes without a problem.. example # ../libexec/check_disk_remote -e rsh -H k3tsyn1 -w 90 -c 98 OK: All Filesystems are below threshold (90/98%) | /=5%;;;0;100 /usr=59%;;;0;100 /var=30%;;;0;100 /opt=58%;;;0;100 But when i do this in my command.cfg file: define command{ command_name check_remote_disk command_line $USER1$/check_disk_remote -e rsh -H k3tsyn1 -w 90 -c 98 } (yes, i know that's the macro thing.. :) its not working also, i put it like that to simulate manual execution) the logs says: [1126686654] SERVICE ALERT: LTX k3tsyn1 ;Disk Usage;UNKNOWN;SOFT;1;(No output returned from plugin) [1126686714] SERVICE ALERT: LTX k3tsyn1 ;Disk Usage;UNKNOWN;SOFT;2;(No output returned from plugin) Thanks! -- ---- Don't Tell Me How Hard You Work.. Show Me How Much You'd Accomplished.. -- ---- Don't Tell Me How Hard You Work.. Show Me How Much You'd Accomplished.. -------------- next part -------------- An HTML attachment was scrubbed... URL: From elizar.palad at gmail.com Wed Sep 14 10:46:47 2005 From: elizar.palad at gmail.com (Elizar M. Palad) Date: Wed, 14 Sep 2005 16:46:47 +0800 Subject: problem with check_disk_remote (solaris plugin) In-Reply-To: <79BF0B53C4A28446A097D6170CF4C69955ED3C@xw2k3-hammbx-03.ads.dlh.de> References: <79BF0B53C4A28446A097D6170CF4C69955ED3C@xw2k3-hammbx-03.ads.dlh.de> Message-ID: its not working! checked the ownhership/permission of the file, both ok nagios 755 but when executed by nagios user, i got: $ ../libexec/check_disk_remote -e rsh -H k3tsyn1 -w 20 -c 30 permission denied permission denied Use of uninitialized value in concatenation (.) or string at ../libexec/check_disk_remote line 119. | checking the file now.. but any inputs are welcome.. thanks! On 9/14/05, oliver.steenbuck at lhsystems.com wrote: > > is the plugin also working if you are nagios and start it from the > commandline ? > > -----Urspr?ngliche Nachricht----- > *Von:* Elizar M. Palad [mailto:elizar.palad at gmail.com] > *Gesendet am:* Mittwoch, 14. September 2005 10:41 > *An:* STEENBUCK, OLIVER > *Cc:* nagios-users at lists.sourceforge.net > *Betreff:* Re: [Nagios-users] problem with check_disk_remote (solaris > plugin) > yes actually, no. I am root in the commandline.. > ? > > > On 9/14/05, oliver.steenbuck at lhsystems.com > wrote: > > > > maybe your try from the commandline was not as the nagios user ? > > > > -----Urspr?ngliche Nachricht----- > > *Von:* nagios-users-admin at lists.sourceforge.net [mailto:nagios-users-admin at lists.sourceforge.net > > ]*Im Auftrag von* Elizar M. Palad > > *Gesendet am:* Mittwoch, 14. September 2005 10:36 > > *An:* nagios-users at lists.sourceforge.net > > *Betreff:* [Nagios-users] problem with check_disk_remote (solaris > > plugin) > > Hi everyone! > > Why is it that when i run the plugin manually, ie, commanline, > > the scripts executes without a problem.. example > > # ../libexec/check_disk_remote -e rsh -H k3tsyn1 -w 90 -c 98 > > OK: All Filesystems are below threshold (90/98%) | /=5%;;;0;100 > > /usr=59%;;;0;100 /var=30%;;;0;100 /opt=58%;;;0;100 > > But when i do this in my command.cfg file: > > define command{ > > command_name check_remote_disk > > command_line $USER1$/check_disk_remote -e rsh -H k3tsyn1 -w 90 -c 98 > > } > > (yes, i know that's the macro thing.. :) its not working also, i put it > > like > > that to simulate manual execution) > > the logs says: > > [1126686654] SERVICE ALERT: LTX k3tsyn1 ;Disk Usage;UNKNOWN;SOFT;1;(No > > output returned from plugin) > > [1126686714] SERVICE ALERT: LTX k3tsyn1 ;Disk Usage;UNKNOWN;SOFT;2;(No > > output returned from plugin) > > Thanks! > > > > -- > > ---- > > Don't Tell Me How Hard You Work.. > > Show Me How Much You'd Accomplished.. > > > > > > > -- > ---- > Don't Tell Me How Hard You Work.. > Show Me How Much You'd Accomplished.. > > -- ---- Don't Tell Me How Hard You Work.. Show Me How Much You'd Accomplished.. -------------- next part -------------- An HTML attachment was scrubbed... URL: From richard.gliebe at fhv.at Wed Sep 14 10:53:05 2005 From: richard.gliebe at fhv.at (Richard Gliebe) Date: Wed, 14 Sep 2005 10:53:05 +0200 Subject: Monitoring temperature Message-ID: <1126687986.841.16.camel@glr-nb.dh.uclv.net> Hi, I'm looking for a Plugin (check_*) to monitor the temperatures from our Cisco switches and routers via snmp. We are running Nagios 2.0b3 on FreeBSD 5.4-STABLE. thanks in advance. Richard -- Richard Gliebe Fachhochschule Vorarlberg GmbH / University for Applied Science Information Services Hochschulstra?e 1, A-6850 Dornbirn Telefon ++43 / (0)5572 / 20336-2207 E-Mail: richard.gliebe at fhv.at ------------------------------------------------------- SF.Net email is sponsored by: Tame your development challenges with Apache's Geronimo App Server. Download it for free - -and be entered to win a 42" plasma tv or your very own Sony(tm)PSP. Click here to play: http://sourceforge.net/geronimo.php _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From oliver.steenbuck at lhsystems.com Wed Sep 14 10:56:56 2005 From: oliver.steenbuck at lhsystems.com (oliver.steenbuck at lhsystems.com) Date: Wed, 14 Sep 2005 10:56:56 +0200 Subject: AW: problem with check_disk_remote (solaris plugin) Message-ID: <79BF0B53C4A28446A097D6170CF4C69955ED3D@xw2k3-hammbx-03.ads.dlh.de> quick googling turned showed taht some other persons who eported your proble had errors in their authentification process. Have you checked your rsh configuration ? -----Urspr?ngliche Nachricht----- Von: Elizar M. Palad [mailto:elizar.palad at gmail.com] Gesendet am: Mittwoch, 14. September 2005 10:47 An: STEENBUCK, OLIVER Cc: nagios-users at lists.sourceforge.net Betreff: Re: [Nagios-users] problem with check_disk_remote (solaris plugin) its not working! checked the ownhership/permission of the file, both ok nagios 755 but when executed by nagios user, i got: $ ../libexec/check_disk_remote -e rsh -H k3tsyn1 -w 20 -c 30 permission denied permission denied Use of uninitialized value in concatenation (.) or string at ../libexec/check_disk_remote line 119. | checking the file now.. but any inputs are welcome.. thanks! On 9/14/05, oliver.steenbuck at lhsystems.com < oliver.steenbuck at lhsystems.com > wrote: is the plugin also working if you are nagios and start it from the commandline ? -----Urspr?ngliche Nachricht----- Von: Elizar M. Palad [mailto: elizar.palad at gmail.com] Gesendet am: Mittwoch, 14. September 2005 10:41 An: STEENBUCK, OLIVER Cc: nagios-users at lists.sourceforge.net Betreff: Re: [Nagios-users] problem with check_disk_remote (solaris plugin) yes actually, no. I am root in the commandline.. ? On 9/14/05, oliver.steenbuck at lhsystems.com < oliver.steenbuck at lhsystems.com > wrote: maybe your try from the commandline was not as the nagios user ? -----Urspr?ngliche Nachricht----- Von: nagios-users-admin at lists.sourceforge.net [mailto: nagios-users-admin at lists.sourceforge.net ]Im Auftrag von Elizar M. Palad Gesendet am: Mittwoch, 14. September 2005 10:36 An: nagios-users at lists.sourceforge.net Betreff: [Nagios-users] problem with check_disk_remote (solaris plugin) Hi everyone! Why is it that when i run the plugin manually, ie, commanline, the scripts executes without a problem.. example # ../libexec/check_disk_remote -e rsh -H k3tsyn1 -w 90 -c 98 OK: All Filesystems are below threshold (90/98%) | /=5%;;;0;100 /usr=59%;;;0;100 /var=30%;;;0;100 /opt=58%;;;0;100 But when i do this in my command.cfg file: define command{ command_name check_remote_disk command_line $USER1$/check_disk_remote -e rsh -H k3tsyn1 -w 90 -c 98 } (yes, i know that's the macro thing.. :) its not working also, i put it like that to simulate manual execution) the logs says: [1126686654] SERVICE ALERT: LTX k3tsyn1 ;Disk Usage;UNKNOWN;SOFT;1;(No output returned from plugin) [1126686714] SERVICE ALERT: LTX k3tsyn1 ;Disk Usage;UNKNOWN;SOFT;2;(No output returned from plugin) Thanks! -- ---- Don't Tell Me How Hard You Work.. Show Me How Much You'd Accomplished.. -- ---- Don't Tell Me How Hard You Work.. Show Me How Much You'd Accomplished.. -- ---- Don't Tell Me How Hard You Work.. Show Me How Much You'd Accomplished.. -------------- next part -------------- An HTML attachment was scrubbed... URL: From robmossrm at aol.com Wed Sep 14 10:59:56 2005 From: robmossrm at aol.com (Rob Moss) Date: Wed, 14 Sep 2005 09:59:56 +0100 Subject: NRPE: unable to read output In-Reply-To: <5F3043372274524C967EB597009D2A0E022F0549@AE0008.BGC.NET> References: <5F3043372274524C967EB597009D2A0E022F0549@AE0008.BGC.NET> Message-ID: <4327E68C.2030904@aol.com> mohamed.azizi at belgacom.be wrote: > > Dear friend , > > I have in Nagios for some checks with the following error :NRPE > unable to read output > How can I solve this problem > > Thanks > > Mohamed > You could start by giving us some information about your setup.. O/S, Nagios version, NRPE version, location of NRPE on the client system etc. Double check the settings in your nrpe.cfg file to ensure that the paths to the check executables are in the right location, say /usr/local/nagios/libexec/check_* etc. rob. -------------- next part -------------- An HTML attachment was scrubbed... URL: From misch at multinet.de Wed Sep 14 11:06:54 2005 From: misch at multinet.de (Michael Schwartzkopff) Date: Wed, 14 Sep 2005 11:06:54 +0200 Subject: Monitoring temperature In-Reply-To: <1126687986.841.16.camel@glr-nb.dh.uclv.net> References: <1126687986.841.16.camel@glr-nb.dh.uclv.net> Message-ID: <200509141106.57078.misch@multinet.de> Am Mittwoch, 14. September 2005 10:53 schrieb Richard Gliebe: > Hi, > > I'm looking for a Plugin (check_*) to monitor the temperatures from our > Cisco switches and routers via snmp. > > We are running Nagios 2.0b3 on FreeBSD 5.4-STABLE. > > thanks in advance. > > Richard Hi, SNMP does it! Look into the CISCO chassis MIBs. A good point to search is www.mibdepot.org. From nagios you can access these values with check_snmp. For me it works great! -- Dr. Michael Schwartzkopff MultiNET Services GmbH Bretonischer Ring 7 85630 Grasbrunn Tel: (+49 89) 456 911 - 0 Fax: (+49 89) 456 911 - 21 mob: (+49 174) 343 28 75 PGP Fingerprint: F919 3919 FF12 ED5A 2801 DEA6 AA77 57A4 EDD8 979B Skype: misch42 -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: application/pgp-signature Size: 189 bytes Desc: not available URL: From savage at savage.za.org Wed Sep 14 11:26:08 2005 From: savage at savage.za.org (Chris Knipe) Date: Wed, 14 Sep 2005 11:26:08 +0200 Subject: notification issues Message-ID: <029601c5b90e$5826b6a0$0a02a8c0@MEGADROID> Hi, I monitor arround 500 services via NRPE2. Up to now, nevermind what I do, Nagios sends a notification everytime NRPE's OUTPUT changes, and not when then STATE changes.... Can anyone please give me some pointers as to what could be causing this???? Thanks, Chris. ------------------------------------------------------- SF.Net email is sponsored by: Tame your development challenges with Apache's Geronimo App Server. Download it for free - -and be entered to win a 42" plasma tv or your very own Sony(tm)PSP. Click here to play: http://sourceforge.net/geronimo.php _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From zhecka at metropol.ru Wed Sep 14 11:28:52 2005 From: zhecka at metropol.ru (Kaltashkin Eugene) Date: Wed, 14 Sep 2005 13:28:52 +0400 Subject: Monitoring temperature Message-ID: <391170E1931DAC4495B21D8069FFFE36505BDA@MX.metropol.ru> www.mibdepot.org link is broken -- Best Regards Kaltashkin Eugene ZHECKA-RIPN > -----Original Message----- > From: nagios-users-admin at lists.sourceforge.net > [mailto:nagios-users-admin at lists.sourceforge.net] On Behalf > Of Michael Schwartzkopff > Sent: Wednesday, September 14, 2005 1:07 PM > To: nagios-users at lists.sourceforge.net > Subject: Re: [Nagios-users] Monitoring temperature > > Am Mittwoch, 14. September 2005 10:53 schrieb Richard Gliebe: > > Hi, > > > > I'm looking for a Plugin (check_*) to monitor the > temperatures from our > > Cisco switches and routers via snmp. > > > > We are running Nagios 2.0b3 on FreeBSD 5.4-STABLE. > > > > thanks in advance. > > > > Richard > > Hi, > > SNMP does it! > > Look into the CISCO chassis MIBs. A good point to search is > www.mibdepot.org. > From nagios you can access these values with check_snmp. For > me it works > great! > > -- > Dr. Michael Schwartzkopff > MultiNET Services GmbH > Bretonischer Ring 7 > 85630 Grasbrunn > > Tel: (+49 89) 456 911 - 0 > Fax: (+49 89) 456 911 - 21 > mob: (+49 174) 343 28 75 > > PGP Fingerprint: F919 3919 FF12 ED5A 2801 DEA6 AA77 57A4 EDD8 979B > Skype: misch42 > ------------------------------------------------------- SF.Net email is sponsored by: Tame your development challenges with Apache's Geronimo App Server. Download it for free - -and be entered to win a 42" plasma tv or your very own Sony(tm)PSP. Click here to play: http://sourceforge.net/geronimo.php _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From richard.gliebe at fhv.at Wed Sep 14 11:37:12 2005 From: richard.gliebe at fhv.at (Richard Gliebe) Date: Wed, 14 Sep 2005 11:37:12 +0200 Subject: Monitoring temperature In-Reply-To: <200509141106.57078.misch@multinet.de> References: <1126687986.841.16.camel@glr-nb.dh.uclv.net> <200509141106.57078.misch@multinet.de> Message-ID: <1126690632.841.20.camel@glr-nb.dh.uclv.net> On Wed, 2005-09-14 at 11:06 +0200, Michael Schwartzkopff wrote: Hi, > Am Mittwoch, 14. September 2005 10:53 schrieb Richard Gliebe: > > Hi, > > > > I'm looking for a Plugin (check_*) to monitor the temperatures from our > > Cisco switches and routers via snmp. > > > > We are running Nagios 2.0b3 on FreeBSD 5.4-STABLE. > > > > thanks in advance. > > > > Richard > > Hi, > > SNMP does it! > > Look into the CISCO chassis MIBs. A good point to search is www.mibdepot.org. > From nagios you can access these values with check_snmp. For me it works > great! thanks for the answer, but the link http://www.mibdepot.org is broken. But http://www.mibdepot.com works. Do you have an example for me, because I'm really new to snmp and MIBs. Thanks Richard -- Richard Gliebe Fachhochschule Vorarlberg GmbH / University for Applied Science Information Services Hochschulstra?e 1, A-6850 Dornbirn Telefon ++43 / (0)5572 / 20336-2207 E-Mail: richard.gliebe at fhv.at ------------------------------------------------------- SF.Net email is sponsored by: Tame your development challenges with Apache's Geronimo App Server. Download it for free - -and be entered to win a 42" plasma tv or your very own Sony(tm)PSP. Click here to play: http://sourceforge.net/geronimo.php _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From elizar.palad at gmail.com Wed Sep 14 11:14:16 2005 From: elizar.palad at gmail.com (Elizar M. Palad) Date: Wed, 14 Sep 2005 17:14:16 +0800 Subject: problem with check_disk_remote (solaris plugin) In-Reply-To: <79BF0B53C4A28446A097D6170CF4C69955ED3D@xw2k3-hammbx-03.ads.dlh.de> References: <79BF0B53C4A28446A097D6170CF4C69955ED3D@xw2k3-hammbx-03.ads.dlh.de> Message-ID: Thanks for the idea! I think i got right.. my root acct never prompts for passwd when doing rlogin/rsh (.rhosts) what i did was, created the nagios user on the remote pc and touched .rhosts in its home directory, putting the nagios server in it. i think that did it.. :) will continue testing.. ill be back here soon! Thanks plenty! regards. elizar.palad www.razile.com On 9/14/05, oliver.steenbuck at lhsystems.com wrote: > > quick googling turned showed taht some other persons who eported your > proble had errors in their authentification process. Have you checked your > rsh configuration ? > > -----Urspr?ngliche Nachricht----- > *Von:* Elizar M. Palad [mailto:elizar.palad at gmail.com] > *Gesendet am:* Mittwoch, 14. September 2005 10:47 > *An:* STEENBUCK, OLIVER > *Cc:* nagios-users at lists.sourceforge.net > *Betreff:* Re: [Nagios-users] problem with check_disk_remote (solaris > plugin) > its not working! checked the ownhership/permission of the file, both ok > nagios 755 > but when executed by nagios user, i got: > $ ../libexec/check_disk_remote -e rsh -H k3tsyn1 -w 20 -c 30 > permission denied > permission denied > Use of uninitialized value in concatenation (.) or string at > ../libexec/check_disk_remote line 119. > | > checking the file now.. but any inputs are welcome.. > thanks! > > > On 9/14/05, oliver.steenbuck at lhsystems.com > wrote: > > > > is the plugin also working if you are nagios and start it from the > > commandline ? > > > > -----Urspr?ngliche Nachricht----- > > *Von:* Elizar M. Palad [mailto: elizar.palad at gmail.com] > > *Gesendet am:* Mittwoch, 14. September 2005 10:41 > > *An:* STEENBUCK, OLIVER > > *Cc:* nagios-users at lists.sourceforge.net > > *Betreff:* Re: [Nagios-users] problem with check_disk_remote (solaris > > plugin) > > yes actually, no. I am root in the commandline.. > > ? > > > > > > On 9/14/05, oliver.steenbuck at lhsystems.com > > wrote: > > > > > > maybe your try from the commandline was not as the nagios user ? > > > > > > -----Urspr?ngliche Nachricht----- > > > *Von:* nagios-users-admin at lists.sourceforge.net [mailto:nagios-users-admin at lists.sourceforge.net > > > ]*Im Auftrag von* Elizar M. Palad > > > *Gesendet am:* Mittwoch, 14. September 2005 10:36 > > > *An:* nagios-users at lists.sourceforge.net > > > *Betreff:* [Nagios-users] problem with check_disk_remote (solaris > > > plugin) > > > Hi everyone! > > > Why is it that when i run the plugin manually, ie, commanline, > > > the scripts executes without a problem.. example > > > # ../libexec/check_disk_remote -e rsh -H k3tsyn1 -w 90 -c 98 > > > OK: All Filesystems are below threshold (90/98%) | /=5%;;;0;100 > > > /usr=59%;;;0;100 /var=30%;;;0;100 /opt=58%;;;0;100 > > > But when i do this in my command.cfg file: > > > define command{ > > > command_name check_remote_disk > > > command_line $USER1$/check_disk_remote -e rsh -H k3tsyn1 -w 90 -c 98 > > > } > > > (yes, i know that's the macro thing.. :) its not working also, i put > > > it like > > > that to simulate manual execution) > > > the logs says: > > > [1126686654] SERVICE ALERT: LTX k3tsyn1 ;Disk Usage;UNKNOWN;SOFT;1;(No > > > output returned from plugin) > > > [1126686714] SERVICE ALERT: LTX k3tsyn1 ;Disk Usage;UNKNOWN;SOFT;2;(No > > > output returned from plugin) > > > Thanks! > > > > > > -- > > > ---- > > > Don't Tell Me How Hard You Work.. > > > Show Me How Much You'd Accomplished.. > > > > > > > > > > > > -- > > ---- > > Don't Tell Me How Hard You Work.. > > Show Me How Much You'd Accomplished.. > > > > > > > -- > ---- > Don't Tell Me How Hard You Work.. > Show Me How Much You'd Accomplished.. > > -- ---- Don't Tell Me How Hard You Work.. Show Me How Much You'd Accomplished.. -------------- next part -------------- An HTML attachment was scrubbed... URL: From ae at op5.se Wed Sep 14 13:26:57 2005 From: ae at op5.se (Andreas Ericsson) Date: Wed, 14 Sep 2005 13:26:57 +0200 Subject: perfparse doesn't display hosts, services In-Reply-To: <20050914002723.90283.qmail@web54706.mail.yahoo.com> References: <20050914002723.90283.qmail@web54706.mail.yahoo.com> Message-ID: <43280901.9000806@op5.se> John Christian wrote: > Hi Gurus, > > I'm trying to integrate PerfParse with Nagios. When I > access the PerfData Graphs page, I get the PerfParse > logo, but no hosts or services are listed. I read that > these fields should be auto-populated if the > performance data is being delivered correctly. As a > result, I suspect the performance data is not making > it all the way to PerfParse. > > Nagios is logging performance data when I point it to > a file. But when I connect Nagios to PerfParse via a > pipe, I don't know how to check that the performance > data is going into the pipe or arriving in the MySQL > database. How do I "look" at the pipe? Or view the > contents of the 'perfparse' database in MySQL? > > Any suggestions on what may be broken or how to > continue diagnosing the problem? TIA! -John > > Additional Info: > > Nagios 2.04b > PerfParse v0.105.6 > SunOS 5.9 Generic_112233-12 Sun-Fire-880 > > It seems PerfParse can connect to the MySQL database: > /usr/local/nagios/bin/check_perfparse_version > OK Perfparse Database Version Correct: 0.19. | > pp_ver=0.19 true_ver=0.19 > > I start perfparsed first and it creates a pipe in > /usr/local/nagios/var: > prw-r----- 1 root other 0 Sep 13 12:08 > perfdata-service.log Read these permissions again, and think about what they actually mean. Hint: Nagios has dropped privileges by the time it tries to open this node for writing. -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Lead Developer ------------------------------------------------------- SF.Net email is sponsored by: Tame your development challenges with Apache's Geronimo App Server. Download it for free - -and be entered to win a 42" plasma tv or your very own Sony(tm)PSP. Click here to play: http://sourceforge.net/geronimo.php _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From holger at CIS.FU-Berlin.DE Wed Sep 14 13:50:58 2005 From: holger at CIS.FU-Berlin.DE (Holger Weiss) Date: Wed, 14 Sep 2005 13:50:58 +0200 Subject: Compile problem - 2.0b4 on FreeBSD 5.1 In-Reply-To: <4327941C.4070601@gothambus.com> References: <4327941C.4070601@gothambus.com> Message-ID: <20050914115058.GB4721765@CIS.FU-Berlin.DE> * Drew Linsalata [2005-09-13 23:08]: > In file included from ../include/config.h:114, > from broker.c:26: > /usr/include/sys/resource.h:61: field `ru_utime' has incomplete type > /usr/include/sys/resource.h:62: field `ru_stime' has incomplete type Including before in include/config.h (or include/config.h.in if you're rerunning configure) should fix it. Holger -- PGP fingerprint: F1F0 9071 8084 A426 DD59 9839 59D3 F3A1 B8B5 D3DE ------------------------------------------------------- SF.Net email is sponsored by: Tame your development challenges with Apache's Geronimo App Server. Download it for free - -and be entered to win a 42" plasma tv or your very own Sony(tm)PSP. Click here to play: http://sourceforge.net/geronimo.php _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Wed Sep 14 13:50:52 2005 From: ae at op5.se (Andreas Ericsson) Date: Wed, 14 Sep 2005 13:50:52 +0200 Subject: Compile problem - 2.0b4 on FreeBSD 5.1 In-Reply-To: <4327941C.4070601@gothambus.com> References: <4327941C.4070601@gothambus.com> Message-ID: <43280E9C.1040803@op5.se> Drew Linsalata wrote: > After a pretty simple "configure" run, we're running into compile issues > with 2.0b4 on a FreeBSD 5.1 box. A "make all" results in: > > > sky1# make all > cd ./base && make > gcc -g -O2 -DHAVE_CONFIG_H -DNSCORE -c broker.c > In file included from ../include/config.h:114, > from broker.c:26: > /usr/include/sys/resource.h:61: field `ru_utime' has incomplete type > /usr/include/sys/resource.h:62: field `ru_stime' has incomplete type > /usr/include/sys/resource.h:79: confused by earlier errors, bailing out > *** Error code 1 > The problems are in the systems header files. As such, the problem is in FreeBSD's development environment and not in Nagios. > Stop in /usr/local/src/nagios-2.0b4/base. > *** Error code 1 > > Stop in /usr/local/src/nagios-2.0b4. > > We don't have the GD/PNG/JPEG stuff on that box yet, but I dont see > where that would cause compile errors. Has anyone seen this before? > > -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Lead Developer ------------------------------------------------------- SF.Net email is sponsored by: Tame your development challenges with Apache's Geronimo App Server. Download it for free - -and be entered to win a 42" plasma tv or your very own Sony(tm)PSP. Click here to play: http://sourceforge.net/geronimo.php _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Wed Sep 14 13:52:55 2005 From: ae at op5.se (Andreas Ericsson) Date: Wed, 14 Sep 2005 13:52:55 +0200 Subject: problem with check_disk_remote (solaris plugin) In-Reply-To: References: Message-ID: <43280F17.30209@op5.se> Elizar M. Palad wrote: > Hi everyone! > Why is it that when i run the plugin manually, ie, commanline, > the scripts executes without a problem.. example > # ../libexec/check_disk_remote -e rsh -H k3tsyn1 -w 90 -c 98 You're running this check as root. > OK: All Filesystems are below threshold (90/98%) | /=5%;;;0;100 > /usr=59%;;;0;100 /var=30%;;;0;100 /opt=58%;;;0;100 > But when i do this in my command.cfg file: > define command{ > command_name check_remote_disk > command_line $USER1$/check_disk_remote -e rsh -H k3tsyn1 -w 90 -c 98 > } This check is run as a different user. Perhaps the user nagios runs as isn't available on the remote host, or it isn't allowed to login through rsh. -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Lead Developer ------------------------------------------------------- SF.Net email is sponsored by: Tame your development challenges with Apache's Geronimo App Server. Download it for free - -and be entered to win a 42" plasma tv or your very own Sony(tm)PSP. Click here to play: http://sourceforge.net/geronimo.php _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From potus98 at yahoo.com Wed Sep 14 13:55:15 2005 From: potus98 at yahoo.com (John Christian) Date: Wed, 14 Sep 2005 04:55:15 -0700 (PDT) Subject: perfparse doesn't display hosts, services In-Reply-To: <43280901.9000806@op5.se> References: <43280901.9000806@op5.se> Message-ID: <20050914115515.88456.qmail@web54705.mail.yahoo.com> Thanks for the suggestion. I stopped nagios and perfparsed, then started perfparse, changed the perms on the perfdata-service.log file, and started nagios. I repated this a few times while trying different perms on the pipe. prwxrwxrwx 1 root other 0 Sep 14 07:34 perfdata-service.log prwxrwxrwx 1 nagios nagios 0 Sep 14 07:34 perfdata-service.log prw-r----- 1 nagios nagios 0 Sep 14 07:40 perfdata-service.log I continue to receive the following error in nagios.log: Warning: File '/usr/local/nagios/var/perfdata-service.log' could not be opened - service performance data will not be written to file! Other ideas on what I'm missing? -John --- Andreas Ericsson wrote: > John Christian wrote: > > Hi Gurus, > > > > I'm trying to integrate PerfParse with Nagios. > When I > > access the PerfData Graphs page, I get the > PerfParse > > logo, but no hosts or services are listed. I read > that > > these fields should be auto-populated if the > > performance data is being delivered correctly. As > a > > result, I suspect the performance data is not > making > > it all the way to PerfParse. > > > > Nagios is logging performance data when I point it > to > > a file. But when I connect Nagios to PerfParse via > a > > pipe, I don't know how to check that the > performance > > data is going into the pipe or arriving in the > MySQL > > database. How do I "look" at the pipe? Or view the > > contents of the 'perfparse' database in MySQL? > > > > Any suggestions on what may be broken or how to > > continue diagnosing the problem? TIA! -John > > > > Additional Info: > > > > Nagios 2.04b > > PerfParse v0.105.6 > > SunOS 5.9 Generic_112233-12 Sun-Fire-880 > > > > It seems PerfParse can connect to the MySQL > database: > > /usr/local/nagios/bin/check_perfparse_version > > OK Perfparse Database Version Correct: 0.19. | > > pp_ver=0.19 true_ver=0.19 > > > > I start perfparsed first and it creates a pipe in > > /usr/local/nagios/var: > > prw-r----- 1 root other 0 Sep 13 > 12:08 > > perfdata-service.log > > > Read these permissions again, and think about what > they actually mean. > Hint: Nagios has dropped privileges by the time it > tries to open this > node for writing. > > -- > Andreas Ericsson > andreas.ericsson at op5.se > OP5 AB www.op5.se > Lead Developer > > > ------------------------------------------------------- > SF.Net email is sponsored by: > Tame your development challenges with Apache's > Geronimo App Server. > Download it for free - -and be entered to win a 42" > plasma tv or your very > own Sony(tm)PSP. Click here to play: > http://sourceforge.net/geronimo.php > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version > (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being > sent to /dev/null > __________________________________ Yahoo! Mail - PC Magazine Editors' Choice 2005 http://mail.yahoo.com ------------------------------------------------------- SF.Net email is sponsored by: Tame your development challenges with Apache's Geronimo App Server. Download it for free - -and be entered to win a 42" plasma tv or your very own Sony(tm)PSP. Click here to play: http://sourceforge.net/geronimo.php _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Wed Sep 14 14:04:55 2005 From: ae at op5.se (Andreas Ericsson) Date: Wed, 14 Sep 2005 14:04:55 +0200 Subject: perfparse doesn't display hosts, services In-Reply-To: <20050914115515.88456.qmail@web54705.mail.yahoo.com> References: <20050914115515.88456.qmail@web54705.mail.yahoo.com> Message-ID: <432811E7.7090209@op5.se> John Christian wrote: > Thanks for the suggestion. I stopped nagios and > perfparsed, then started perfparse, changed the perms > on the perfdata-service.log file, and started nagios. > I repated this a few times while trying different > perms on the pipe. > > prwxrwxrwx 1 root other 0 Sep 14 07:34 > perfdata-service.log > > prwxrwxrwx 1 nagios nagios 0 Sep 14 07:34 > perfdata-service.log > > prw-r----- 1 nagios nagios 0 Sep 14 07:40 > perfdata-service.log > > I continue to receive the following error in > nagios.log: > > Warning: File > '/usr/local/nagios/var/perfdata-service.log' could not > be opened - service performance data will not be > written to file! > > Other ideas on what I'm missing? Well, since Nagios claims it can't write to the *file*, and the inode is in fact a pipe, it might be causing some sort of error. I'm not very familiar with perfparse stuff, but I seem to remember something about having a cron-job run every once in a while that empties the perfparse data-file and submits it to the pipe (or some such). -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Lead Developer ------------------------------------------------------- SF.Net email is sponsored by: Tame your development challenges with Apache's Geronimo App Server. Download it for free - -and be entered to win a 42" plasma tv or your very own Sony(tm)PSP. Click here to play: http://sourceforge.net/geronimo.php _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Ralph.Grothe at itdz-berlin.de Wed Sep 14 14:08:07 2005 From: Ralph.Grothe at itdz-berlin.de (Ralph.Grothe at itdz-berlin.de) Date: Wed, 14 Sep 2005 14:08:07 +0200 Subject: Multiple notification_interval settings for the same service defi nition Message-ID: <6B893C5F2902D311A23F0090272854FB0575B809@litex001.lit.verwalt-berlin.de> Hello Nagios Experts, I wonder how to handle this. I would like Nagios to send out different notifications about a failed service to several recipients with varying degree of repition. The need for such is pretty obvious, since I need to file a trouble ticket (TT) with our TT management software only *once* (caveat TT flooding), but on the other hand for certain admins or users of the failed service it should be perfectly in order if they received repetitive notifications (sort of to increase the nag level to boost their intervention). I have no idea how to configure this. If I set the attribute notification_interval to 0 for a service, according to the docs there should be issued a notification only once, which was fine for the TT generation but missed the reminder to admins and users. As far as I can see one can also only define one event_handler per service. Because I think this is a common issue I'm confident that some of you already know a work-around. Regards Ralph ------------------------------------------------------- SF.Net email is sponsored by: Tame your development challenges with Apache's Geronimo App Server. Download it for free - -and be entered to win a 42" plasma tv or your very own Sony(tm)PSP. Click here to play: http://sourceforge.net/geronimo.php _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From holger at CIS.FU-Berlin.DE Wed Sep 14 14:21:52 2005 From: holger at CIS.FU-Berlin.DE (Holger Weiss) Date: Wed, 14 Sep 2005 14:21:52 +0200 Subject: Multiple notification_interval settings for the same service defi nition In-Reply-To: <6B893C5F2902D311A23F0090272854FB0575B809@litex001.lit.verwalt-berlin.de> References: <6B893C5F2902D311A23F0090272854FB0575B809@litex001.lit.verwalt-berlin.de> Message-ID: <20050914122152.GC4721765@CIS.FU-Berlin.DE> * Ralph.Grothe at itdz-berlin.de [2005-09-14 14:08]: > I would like Nagios to send out different notifications about a > failed service to several recipients with varying degree of repition. Service escalations should do the job: http://nagios.sourceforge.net/docs/2_0/escalations.html Holger -- PGP fingerprint: F1F0 9071 8084 A426 DD59 9839 59D3 F3A1 B8B5 D3DE ------------------------------------------------------- SF.Net email is sponsored by: Tame your development challenges with Apache's Geronimo App Server. Download it for free - -and be entered to win a 42" plasma tv or your very own Sony(tm)PSP. Click here to play: http://sourceforge.net/geronimo.php _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From holger at CIS.FU-Berlin.DE Wed Sep 14 14:57:01 2005 From: holger at CIS.FU-Berlin.DE (Holger Weiss) Date: Wed, 14 Sep 2005 14:57:01 +0200 Subject: PDF documentation for 2.0 Message-ID: <20050914125700.GD4721765@CIS.FU-Berlin.DE> In order to print them, I converted the HTML docs for Nagios 2.0 to PDF. Just in case anyone else is interested: ftp://ftp.in-berlin.de/pub/users/weiss/doc/nagios/nagios-2.pdf Holger -- PGP fingerprint: F1F0 9071 8084 A426 DD59 9839 59D3 F3A1 B8B5 D3DE ------------------------------------------------------- SF.Net email is sponsored by: Tame your development challenges with Apache's Geronimo App Server. Download it for free - -and be entered to win a 42" plasma tv or your very own Sony(tm)PSP. Click here to play: http://sourceforge.net/geronimo.php _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ton.voon at altinity.com Wed Sep 14 14:54:41 2005 From: ton.voon at altinity.com (Ton Voon) Date: Wed, 14 Sep 2005 13:54:41 +0100 Subject: perfparse doesn't display hosts, services In-Reply-To: <20050914115515.88456.qmail@web54705.mail.yahoo.com> References: <20050914115515.88456.qmail@web54705.mail.yahoo.com> Message-ID: <63E38EB4-B92A-4E0C-A05E-A571CC8F6DD3@altinity.com> John, I haven't used perfparse, but I had a problem when I was trying to use a daemon that was reading data from the named pipe but nagios could not write to the named pipe. Problem was that nagios tries to open the perf file in append mode (which makes sense if it is actually a file). This fails for a pipe (at least on Solaris 2.6). So I patched nagios to open it in write mode - please find attached. This is against Nagios 1.0, so it may have moved for later versions. Ton -------------- next part -------------- A non-text attachment was scrubbed... Name: xpdfile.c.patch Type: application/octet-stream Size: 928 bytes Desc: not available URL: -------------- next part -------------- On 14 Sep 2005, at 12:55, John Christian wrote: > Thanks for the suggestion. I stopped nagios and > perfparsed, then started perfparse, changed the perms > on the perfdata-service.log file, and started nagios. > I repated this a few times while trying different > perms on the pipe. > > prwxrwxrwx 1 root other 0 Sep 14 07:34 > perfdata-service.log > > prwxrwxrwx 1 nagios nagios 0 Sep 14 07:34 > perfdata-service.log > > prw-r----- 1 nagios nagios 0 Sep 14 07:40 > perfdata-service.log > > I continue to receive the following error in > nagios.log: > > Warning: File > '/usr/local/nagios/var/perfdata-service.log' could not > be opened - service performance data will not be > written to file! > > Other ideas on what I'm missing? > -John http://www.altinity.com T: +44 (0)870 787 9243 F: +44 (0)845 280 1725 Skype: tonvoon The contents of this email and any files transmitted with it are confidential and intended solely for the use of the individuals to whom it is addressed. If you are not the intended recipient or have received this e-mail in error please notify the sender and destroy this e-mail immediately. Any unauthorised copying, disclosure or distribution of the material in this e-mail is strictly prohibited. From elizar.palad at gmail.com Wed Sep 14 10:55:20 2005 From: elizar.palad at gmail.com (Elizar M. Palad) Date: Wed, 14 Sep 2005 16:55:20 +0800 Subject: problem with check_disk_remote (solaris plugin) In-Reply-To: References: <79BF0B53C4A28446A097D6170CF4C69955ED3C@xw2k3-hammbx-03.ads.dlh.de> Message-ID: oh btw, i have very little programming background.. :) help? sorry if i reply to my post.. On 9/14/05, Elizar M. Palad wrote: > > its not working! checked the ownhership/permission of the file, both ok > nagios 755 > but when executed by nagios user, i got: > $ ../libexec/check_disk_remote -e rsh -H k3tsyn1 -w 20 -c 30 > permission denied > permission denied > Use of uninitialized value in concatenation (.) or string at > ../libexec/check_disk_remote line 119. > | > checking the file now.. but any inputs are welcome.. > thanks! > > > On 9/14/05, oliver.steenbuck at lhsystems.com > wrote: > > > > is the plugin also working if you are nagios and start it from the > > commandline ? > > > > -----Urspr?ngliche Nachricht----- > > *Von:* Elizar M. Palad [mailto: elizar.palad at gmail.com] > > *Gesendet am:* Mittwoch, 14. September 2005 10:41 > > *An:* STEENBUCK, OLIVER > > *Cc:* nagios-users at lists.sourceforge.net > > *Betreff:* Re: [Nagios-users] problem with check_disk_remote (solaris > > plugin) > > yes actually, no. I am root in the commandline.. > > ? > > > > > > On 9/14/05, oliver.steenbuck at lhsystems.com > > wrote: > > > > > > maybe your try from the commandline was not as the nagios user ? > > > > > > -----Urspr?ngliche Nachricht----- > > > *Von:* nagios-users-admin at lists.sourceforge.net [mailto:nagios-users-admin at lists.sourceforge.net > > > ]*Im Auftrag von* Elizar M. Palad > > > *Gesendet am:* Mittwoch, 14. September 2005 10:36 > > > *An:* nagios-users at lists.sourceforge.net > > > *Betreff:* [Nagios-users] problem with check_disk_remote (solaris > > > plugin) > > > Hi everyone! > > > Why is it that when i run the plugin manually, ie, commanline, > > > the scripts executes without a problem.. example > > > # ../libexec/check_disk_remote -e rsh -H k3tsyn1 -w 90 -c 98 > > > OK: All Filesystems are below threshold (90/98%) | /=5%;;;0;100 > > > /usr=59%;;;0;100 /var=30%;;;0;100 /opt=58%;;;0;100 > > > But when i do this in my command.cfg file: > > > define command{ > > > command_name check_remote_disk > > > command_line $USER1$/check_disk_remote -e rsh -H k3tsyn1 -w 90 -c 98 > > > } > > > (yes, i know that's the macro thing.. :) its not working also, i put > > > it like > > > that to simulate manual execution) > > > the logs says: > > > [1126686654] SERVICE ALERT: LTX k3tsyn1 ;Disk Usage;UNKNOWN;SOFT;1;(No > > > output returned from plugin) > > > [1126686714] SERVICE ALERT: LTX k3tsyn1 ;Disk Usage;UNKNOWN;SOFT;2;(No > > > output returned from plugin) > > > Thanks! > > > > > > -- > > > ---- > > > Don't Tell Me How Hard You Work.. > > > Show Me How Much You'd Accomplished.. > > > > > > > > > > > > -- > > ---- > > Don't Tell Me How Hard You Work.. > > Show Me How Much You'd Accomplished.. > > > > > > > -- > ---- > Don't Tell Me How Hard You Work.. > Show Me How Much You'd Accomplished.. > -- ---- Don't Tell Me How Hard You Work.. Show Me How Much You'd Accomplished.. -------------- next part -------------- An HTML attachment was scrubbed... URL: From drew at gothambus.com Wed Sep 14 15:26:08 2005 From: drew at gothambus.com (Drew Linsalata) Date: Wed, 14 Sep 2005 09:26:08 -0400 Subject: Compile problem - 2.0b4 on FreeBSD 5.1 In-Reply-To: <20050914115058.GB4721765@CIS.FU-Berlin.DE> References: <4327941C.4070601@gothambus.com> <20050914115058.GB4721765@CIS.FU-Berlin.DE> Message-ID: <432824F0.2060800@gothambus.com> Holger Weiss wrote: > * Drew Linsalata [2005-09-13 23:08]: > >>In file included from ../include/config.h:114, >> from broker.c:26: >>/usr/include/sys/resource.h:61: field `ru_utime' has incomplete type >>/usr/include/sys/resource.h:62: field `ru_stime' has incomplete type > > > Including before in include/config.h (or > include/config.h.in if you're rerunning configure) should fix it. That did solve the complile problem. Thanks! -- Drew Linsalata The Gotham Bus Company, Inc. Dedicated Servers and Colocation Solutions Long Island, New York http://www.gothambus.com ------------------------------------------------------- SF.Net email is sponsored by: Tame your development challenges with Apache's Geronimo App Server. Download it for free - -and be entered to win a 42" plasma tv or your very own Sony(tm)PSP. Click here to play: http://sourceforge.net/geronimo.php _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Ralph.Grothe at itdz-berlin.de Wed Sep 14 15:32:01 2005 From: Ralph.Grothe at itdz-berlin.de (Ralph.Grothe at itdz-berlin.de) Date: Wed, 14 Sep 2005 15:32:01 +0200 Subject: Multiple notification_interval settings for th e same service defi nition Message-ID: <6B893C5F2902D311A23F0090272854FB0575B80A@litex001.lit.verwalt-berlin.de> Holger, thanks for pointing me to the chapter of the docs that treats service escalations. I haven't given it attention yet because I considered it one of the more advanced features. I will peruse and educate myself more, and might come back if I still need clarification. N.b. many thanks for providing a comprehensive PDF volume of the scattered HTML documentation. This will make a far better hardcopy manual which I prefer over any documentation that would force me into reading more than 20 pages on a screen. Another problem with the offline printout from the HTML for me was that the ruddy (ambiguity intended ;-) type of the required attributes from definitions didn't at all distinguish optically from the remaining optional attributes in my b/w print. Cheers Ralph > -----Original Message----- > From: nagios-users-admin at lists.sourceforge.net > [mailto:nagios-users-admin at lists.sourceforge.net]On Behalf Of Holger > Weiss > Sent: Wednesday, September 14, 2005 2:22 PM > To: nagios-users at lists.sourceforge.net > Subject: Re: [Nagios-users] Multiple notification_interval > settings for > the same service defi nition > > > * Ralph.Grothe at itdz-berlin.de [2005-09-14 14:08]: > > I would like Nagios to send out different notifications about a > > failed service to several recipients with varying degree of > repition. > > Service escalations should do the job: > > http://nagios.sourceforge.net/docs/2_0/escalations.html > > Holger > > -- > PGP fingerprint: F1F0 9071 8084 A426 DD59 9839 59D3 F3A1 B8B5 D3DE > > > ------------------------------------------------------- > SF.Net email is sponsored by: > Tame your development challenges with Apache's Geronimo App Server. > Download it for free - -and be entered to win a 42" plasma tv > or your very > own Sony(tm)PSP. Click here to play: > http://sourceforge.net/geronimo.php > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS > when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > ------------------------------------------------------- SF.Net email is sponsored by: Tame your development challenges with Apache's Geronimo App Server. Download it for free - -and be entered to win a 42" plasma tv or your very own Sony(tm)PSP. Click here to play: http://sourceforge.net/geronimo.php _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Wed Sep 14 15:35:27 2005 From: marc at ena.com (Marc Powell) Date: Wed, 14 Sep 2005 08:35:27 -0500 Subject: Monitoring temperature Message-ID: > -----Original Message----- > From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users- > admin at lists.sourceforge.net] On Behalf Of Richard Gliebe > Sent: Wednesday, September 14, 2005 3:53 AM > To: Nagios List > Subject: [Nagios-users] Monitoring temperature > > Hi, > > I'm looking for a Plugin (check_*) to monitor the temperatures from our > Cisco switches and routers via snmp. This was a topic of discussion on this list last week or the week before that included several solutions. -- marc ------------------------------------------------------- SF.Net email is sponsored by: Tame your development challenges with Apache's Geronimo App Server. Download it for free - -and be entered to win a 42" plasma tv or your very own Sony(tm)PSP. Click here to play: http://sourceforge.net/geronimo.php _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Wed Sep 14 15:42:04 2005 From: marc at ena.com (Marc Powell) Date: Wed, 14 Sep 2005 08:42:04 -0500 Subject: notification issues Message-ID: > -----Original Message----- > From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users- > admin at lists.sourceforge.net] On Behalf Of Chris Knipe > Sent: Wednesday, September 14, 2005 4:26 AM > To: nagios-users at lists.sourceforge.net > Subject: [Nagios-users] notification issues > > Hi, > > I monitor arround 500 services via NRPE2. Up to now, nevermind what I do, > Nagios sends a notification everytime NRPE's OUTPUT changes, and not when > then STATE changes.... That's quite strange considering nagios doesn't parse the plugin output in any way for status information. Status is entirely derived from the plugin exit code. Notifications are entirely based on the status. > > Can anyone please give me some pointers as to what could be causing > this???? Without specific configuration examples and test runs, not really. OS information, nagios version, how it was installed, etc may be useful as well. Try running the NRPE commands exactly as they are defined in your command definition, substituting appropriate macros of course. Do this as the nagios user and use 'echo $?' after testing to verify the exit code. -- marc ------------------------------------------------------- SF.Net email is sponsored by: Tame your development challenges with Apache's Geronimo App Server. Download it for free - -and be entered to win a 42" plasma tv or your very own Sony(tm)PSP. Click here to play: http://sourceforge.net/geronimo.php _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From magle at cacdhh.org Wed Sep 14 15:40:36 2005 From: magle at cacdhh.org (Matthew Agle) Date: Wed, 14 Sep 2005 09:40:36 -0400 Subject: suggestions for upgrading Nagios In-Reply-To: <432525E8.7080001@op5.se> References: <432525E8.7080001@op5.se> Message-ID: <002501c5b931$e5e45650$8400a8c0@sshi.local> Thanks to you all for the suggestions! I will try to get this rolling and see what happens (but just in case I am building it on my machine first and then I'll go from there...). Thanks again! Sincerely, ? Matthew Agle - MCP, MCSA Network Engineer CACDHH Information Technology Phone: (810)239-3112 ext. 233 Email:? magle at cacdhh.org Disclaimer: The information transmitted is intended only for the person or entity to whom or which it is addressed and may contain confidential and/or privileged material. Any review, retransmission, dissemination or other use of this information by persons or entities other than the intended recipient is prohibited. If you receive this in error, please delete this material immediately. -----Original Message----- From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users-admin at lists.sourceforge.net] On Behalf Of Andreas Ericsson Sent: Monday, September 12, 2005 2:53 AM To: nagios-users at lists.sourceforge.net Subject: Re: [Nagios-users] suggestions for upgrading Nagios Fred wrote: > I believe there are a number of changed configuration variables that > will be obvious when you try to start a 1.2 config under 2.0b*, I know > there were a number of syntax errors when I upgraded. > Don't forget the macros in the notification commands, or we'll be answering that particular question for the 16033rd time tomorrow. > -FredC > > --- Greg Vickers wrote: > > >>Matt, >> >>Matthew Agle wrote: >> >>>Hello, >>> >>> I have currently running Nagios version 1.2 and looking to >>>upgrade to 2.04b. Has anyone done this and/or performed a upgrade and >>>if so what suggestions/tips would you have? Is it easier to upgrade or >>>simply install to a different location and point the config files >>>there? Thanks in advance for any feedback! >> >>RTFM is your friend. I would very carefully read what is new in version >>2.0, in the "What's new in this version" section of the online manual. >> >>http://www.nagios.org, Support, Online Documentation, v2.x HTML, Table >>of Contents, What's new in this version.... >> >>If I were you, I would pay special attention to the Hostgroup changes. >> >>Take a copy of your 1.x config and run the 2.x binary against it, see >>what is broken, that will give you a good starting point. >> >>HTH, >> >>[p.s.] OK, ok, the only change you *have* to make is moving the new >>location of the contact_groups directive... but there's so much other >>new and better stuff in v2.x, have a look and see what is applicable to >>your situation. >> >>-- >>Greg Vickers >>Project Manager, IT Security >>Information Technology Services >>Queensland University of Technology >>L12, 126 Margaret St, Brisbane >> >>Phone: (07) 3864 9536 >>Email: g.vickers at qut.edu.au >>IT Security web site: http://www.its.qut.edu.au/itsecurity/ >> >>CRICOS No. 00213J >> >> >>------------------------------------------------------- >>SF.Net email is Sponsored by the Better Software Conference & EXPO >>September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices >>Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA >>Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf >>_______________________________________________ >>Nagios-users mailing list >>Nagios-users at lists.sourceforge.net >>https://lists.sourceforge.net/lists/listinfo/nagios-users >>::: Please include Nagios version, plugin version (-v) and OS when reporting >>any issue. >>::: Messages without supporting info will risk being sent to /dev/null >> > > > > > > > > > ------------------------------------------------------- > SF.Net email is Sponsored by the Better Software Conference & EXPO > September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices > Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA > Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Lead Developer ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------- SF.Net email is sponsored by: Tame your development challenges with Apache's Geronimo App Server. Download it for free - -and be entered to win a 42" plasma tv or your very own Sony(tm)PSP. Click here to play: http://sourceforge.net/geronimo.php _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Wed Sep 14 15:53:14 2005 From: marc at ena.com (Marc Powell) Date: Wed, 14 Sep 2005 08:53:14 -0500 Subject: notification issues Message-ID: Mmmm. Talking to myself now I see... > -----Original Message----- > From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users- > admin at lists.sourceforge.net] On Behalf Of Marc Powell > Sent: Wednesday, September 14, 2005 8:42 AM > To: Nagios List > Subject: RE: [Nagios-users] notification issues > > > > > -----Original Message----- > > From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users- > > admin at lists.sourceforge.net] On Behalf Of Chris Knipe > > Sent: Wednesday, September 14, 2005 4:26 AM > > To: nagios-users at lists.sourceforge.net > > Subject: [Nagios-users] notification issues > > > > Hi, > > > > I monitor arround 500 services via NRPE2. Up to now, nevermind what I > do, > > Nagios sends a notification everytime NRPE's OUTPUT changes, and not > when > > then STATE changes.... > > That's quite strange considering nagios doesn't parse the plugin output > in any way for status information. Status is entirely derived from the > plugin exit code. Notifications are entirely based on the status. As a followup, there is one possibility. IFF the service status is not OK for each check AND you have is_volatile set for the service, then you would get an alert for each check. While it doesn't have anything to do with the plugin output, it may appear that way. http://nagios.sourceforge.net/docs/1_0/volatileservices.html marc ------------------------------------------------------- SF.Net email is sponsored by: Tame your development challenges with Apache's Geronimo App Server. Download it for free - -and be entered to win a 42" plasma tv or your very own Sony(tm)PSP. Click here to play: http://sourceforge.net/geronimo.php _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From savage at savage.za.org Wed Sep 14 16:00:52 2005 From: savage at savage.za.org (Chris Knipe) Date: Wed, 14 Sep 2005 16:00:52 +0200 Subject: notification issues References: Message-ID: <03e101c5b934$b99d0300$0a02a8c0@MEGADROID> Hi Marc, Thanks very much. I did check error / return codes, they return 0 - still triggering alerts. I have disabled volatile in the services configuration, will see if that solves it. Thanks again, Chris ----- Original Message ----- From: "Marc Powell" To: "Nagios List" Sent: Wednesday, September 14, 2005 3:53 PM Subject: RE: [Nagios-users] notification issues Mmmm. Talking to myself now I see... > -----Original Message----- > From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users- > admin at lists.sourceforge.net] On Behalf Of Marc Powell > Sent: Wednesday, September 14, 2005 8:42 AM > To: Nagios List > Subject: RE: [Nagios-users] notification issues > > > > > -----Original Message----- > > From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users- > > admin at lists.sourceforge.net] On Behalf Of Chris Knipe > > Sent: Wednesday, September 14, 2005 4:26 AM > > To: nagios-users at lists.sourceforge.net > > Subject: [Nagios-users] notification issues > > > > Hi, > > > > I monitor arround 500 services via NRPE2. Up to now, nevermind what I > do, > > Nagios sends a notification everytime NRPE's OUTPUT changes, and not > when > > then STATE changes.... > > That's quite strange considering nagios doesn't parse the plugin output > in any way for status information. Status is entirely derived from the > plugin exit code. Notifications are entirely based on the status. As a followup, there is one possibility. IFF the service status is not OK for each check AND you have is_volatile set for the service, then you would get an alert for each check. While it doesn't have anything to do with the plugin output, it may appear that way. http://nagios.sourceforge.net/docs/1_0/volatileservices.html marc ------------------------------------------------------- SF.Net email is sponsored by: Tame your development challenges with Apache's Geronimo App Server. Download it for free - -and be entered to win a 42" plasma tv or your very own Sony(tm)PSP. Click here to play: http://sourceforge.net/geronimo.php _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------- SF.Net email is sponsored by: Tame your development challenges with Apache's Geronimo App Server. Download it for free - -and be entered to win a 42" plasma tv or your very own Sony(tm)PSP. Click here to play: http://sourceforge.net/geronimo.php _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From enediel at hotmail.com Wed Sep 14 17:10:24 2005 From: enediel at hotmail.com (enediel gonzalez) Date: Wed, 14 Sep 2005 15:10:24 +0000 Subject: control panel for nagios Message-ID: hello everyone I have nagios running but, I'm looking for a control panel to it. ?Have anybody installed ncpl? I downloaded it from the sourforge repository ?Any other suggestion? Thanks in advance for any help Regards Enediel ------------------------------------------------------- SF.Net email is sponsored by: Tame your development challenges with Apache's Geronimo App Server. Download it for free - -and be entered to win a 42" plasma tv or your very own Sony(tm)PSP. Click here to play: http://sourceforge.net/geronimo.php _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Ralph.Grothe at itdz-berlin.de Wed Sep 14 17:16:56 2005 From: Ralph.Grothe at itdz-berlin.de (Ralph.Grothe at itdz-berlin.de) Date: Wed, 14 Sep 2005 17:16:56 +0200 Subject: Multiple notification_interval settings for th e same service defi nition Message-ID: <6B893C5F2902D311A23F0090272854FB0575B80C@litex001.lit.verwalt-berlin.de> Sorry, for haunting you back. But service escalations don't quite seem to be an elegant remedy. This may be due to a misapprehension of the subject I still suffer from after having read the parts in the doc that cover service escalations. Because I would need to run two different event_handlers (viz. one to file a trouble ticket at a high escalation level only once, and one to send out repeated nagging email notifications to admins or some other poor buggers), this also meant I would have to (re)introduce another couple of hundred service definitions that only would differ in their service_description to be referenced accordingly by their respective serviceescalation definition. This looks prohibitive a prospect to me. Unless there was a possibility to implement a case distinction (e.g. maybe through some macro variable like $SERVICEATTEMPT$), sort of polymorphism, by which the called event_handler would decide which incarnation to execute at run time. (I think the OO folks call it late binding?) Can one fumble up something like that, or am I completely off the track? > -----Original Message----- > From: nagios-users-admin at lists.sourceforge.net > [mailto:nagios-users-admin at lists.sourceforge.net]On Behalf Of > Ralph.Grothe at itdz-berlin.de > Sent: Wednesday, September 14, 2005 3:32 PM > To: holger at CIS.FU-Berlin.DE; nagios-users at lists.sourceforge.net > Subject: RE: [Nagios-users] Multiple notification_interval > settings for > th e same service defi nition > > > Holger, > > thanks for pointing me to the chapter of the docs that treats > service escalations. > > I haven't given it attention yet because I considered it one of > the more advanced features. > > I will peruse and educate myself more, and might come back if I > still need clarification. > > N.b. many thanks for providing a comprehensive PDF volume of the > scattered HTML documentation. > This will make a far better hardcopy manual which I prefer over > any documentation that would force me > into reading more than 20 pages on a screen. > Another problem with the offline printout from the HTML for me > was that the ruddy (ambiguity intended ;-) > type of the required attributes from definitions didn't at all > distinguish optically from the remaining > optional attributes in my b/w print. > > Cheers > Ralph > > > > -----Original Message----- > > From: nagios-users-admin at lists.sourceforge.net > > [mailto:nagios-users-admin at lists.sourceforge.net]On Behalf Of > Holger > > Weiss > > Sent: Wednesday, September 14, 2005 2:22 PM > > To: nagios-users at lists.sourceforge.net > > Subject: Re: [Nagios-users] Multiple notification_interval > > settings for > > the same service defi nition > > > > > > * Ralph.Grothe at itdz-berlin.de [2005-09-14 14:08]: > > > I would like Nagios to send out different notifications about > a > > > failed service to several recipients with varying degree of > > repition. > > > > Service escalations should do the job: > > > > http://nagios.sourceforge.net/docs/2_0/escalations.html > > > > Holger > > > > -- > > PGP fingerprint: F1F0 9071 8084 A426 DD59 9839 59D3 F3A1 B8B5 > D3DE > > > > > > ------------------------------------------------------- > > SF.Net email is sponsored by: > > Tame your development challenges with Apache's Geronimo App > Server. > > Download it for free - -and be entered to win a 42" plasma tv > > or your very > > own Sony(tm)PSP. Click here to play: > > http://sourceforge.net/geronimo.php > > _______________________________________________ > > Nagios-users mailing list > > Nagios-users at lists.sourceforge.net > > https://lists.sourceforge.net/lists/listinfo/nagios-users > > ::: Please include Nagios version, plugin version (-v) and OS > > when reporting any issue. > > ::: Messages without supporting info will risk being sent to > /dev/null > > > > > ------------------------------------------------------- > SF.Net email is sponsored by: > Tame your development challenges with Apache's Geronimo App Server. > Download it for free - -and be entered to win a 42" plasma tv > or your very > own Sony(tm)PSP. Click here to play: http://sourceforge.net/geronimo.php _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------- SF.Net email is sponsored by: Tame your development challenges with Apache's Geronimo App Server. Download it for free - -and be entered to win a 42" plasma tv or your very own Sony(tm)PSP. Click here to play: http://sourceforge.net/geronimo.php _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Wed Sep 14 17:23:43 2005 From: marc at ena.com (Marc Powell) Date: Wed, 14 Sep 2005 10:23:43 -0500 Subject: PDF documentation for 2.0 Message-ID: > -----Original Message----- > From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users- > admin at lists.sourceforge.net] On Behalf Of Holger Weiss > Sent: Wednesday, September 14, 2005 7:57 AM > To: Nagios Users > Subject: [Nagios-users] PDF documentation for 2.0 > > In order to print them, I converted the HTML docs for Nagios 2.0 to PDF. > Just in case anyone else is interested: > > ftp://ftp.in-berlin.de/pub/users/weiss/doc/nagios/nagios-2.pdf Thanks for this! I'm not sure Ethan watches this list much these days so you may want to bounce this over to nagios-devel and see if he'll post it on www.nagios.org. -- Marc ------------------------------------------------------- SF.Net email is sponsored by: Tame your development challenges with Apache's Geronimo App Server. Download it for free - -and be entered to win a 42" plasma tv or your very own Sony(tm)PSP. Click here to play: http://sourceforge.net/geronimo.php _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ravikmrs at yahoo.com Wed Sep 14 17:23:27 2005 From: ravikmrs at yahoo.com (Ravi Kumar) Date: Wed, 14 Sep 2005 08:23:27 -0700 (PDT) Subject: Load graph Message-ID: <20050914152327.82853.qmail@web53906.mail.yahoo.com> I want to plot graph on load monitoring. I'm using rrd to create graph but the graph didn't show details. rrdtool create ta2-prod-load.rrd -s 60 DS:load:GAUGE:0:300:U RRA:AVERAGE:0.5:1:50400 RRA:AVERA E:0.5:60:43800 please help what is wrong in above command, thanks --------------------------------- Yahoo! for Good Click here to donate to the Hurricane Katrina relief effort. -------------- next part -------------- An HTML attachment was scrubbed... URL: From james.mohr at elaxy.com Wed Sep 14 17:59:11 2005 From: james.mohr at elaxy.com (Mohr James) Date: Wed, 14 Sep 2005 17:59:11 +0200 Subject: NRPE Character limitation Message-ID: Hi All! We have run into the infamous 350 character nrpe boundary. I have done a a lot of research on what the problem is, but what I am missing is *where* the problem is. What I have been able to find out is that the problem lies in the limit of an atomic write to a pipe and therefore the nrpe code has been configure to limit the size of messages. The question is where in the chain is this pipe? I found the my_system function in nrpe.c where it uses a pipe. However, shouldn't this pipe be unique to the process? If so, then this shouldn't be the place where there is a problem, or is it? Looking through the code just within the nagios base directory, there are a number of places where various pipes are used, so I am having trouble finding the exact spot. The bottom line is we want/need to return larger amounts of data. Thus, I need to find the place(s) where I should change the code, as well as the place(s) where there are problems. I cannot find within the nrpe code, where this limit is defined. We have called check_nrpe directly, avoiding Nagios completely and we still have this limitation, so it seems that the limit is either in check_nrpe or nrpe itself. Thus, I cannot find where I need to increase this value. If the problem really is within Nagios, is it possible to use the Nagios Event Broker to catch the messages, write the complete text to a file and then cut the message down to the 350 character limit? Any help is greatly appreaciated. Regards, Jim Mohr ------------------------------------------------------- SF.Net email is sponsored by: Tame your development challenges with Apache's Geronimo App Server. Download it for free - -and be entered to win a 42" plasma tv or your very own Sony(tm)PSP. Click here to play: http://sourceforge.net/geronimo.php _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From sudheer at tgs-solutions.com Wed Sep 14 17:58:39 2005 From: sudheer at tgs-solutions.com (Sudheer Muddappa) Date: Wed, 14 Sep 2005 11:58:39 -0400 Subject: Oracle DB Plugin Message-ID: <432848AF.3020804@tgs-solutions.com> Hi all, Is there a plugin to monitor the oracle DB? Please let me know. Thanks, -- Sudheer Muddappa ------------------------------------------------------- SF.Net email is sponsored by: Tame your development challenges with Apache's Geronimo App Server. Download it for free - -and be entered to win a 42" plasma tv or your very own Sony(tm)PSP. Click here to play: http://sourceforge.net/geronimo.php _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From holger at CIS.FU-Berlin.DE Wed Sep 14 18:10:53 2005 From: holger at CIS.FU-Berlin.DE (Holger Weiss) Date: Wed, 14 Sep 2005 18:10:53 +0200 Subject: PDF documentation for 2.0 (was: Multiple notification_interval settings for the same service definition) In-Reply-To: <6B893C5F2902D311A23F0090272854FB0575B80A@litex001.lit.verwalt-berlin.de> References: <6B893C5F2902D311A23F0090272854FB0575B80A@litex001.lit.verwalt-berlin.de> Message-ID: <20050914161052.GA4730212@CIS.FU-Berlin.DE> * Ralph.Grothe at itdz-berlin.de [2005-09-14 15:32]: > N.b. many thanks for providing a comprehensive PDF volume of the > scattered HTML documentation. This will make a far better hardcopy > manual which I prefer over any documentation that would force me into > reading more than 20 pages on a screen. Another problem with the > offline printout from the HTML for me was that the ruddy (ambiguity > intended ;-) type of the required attributes from definitions didn't > at all distinguish optically from the remaining optional attributes in > my b/w print. The PDF was grayscaled too, though. I've just uploaded a colored version and moved the old file to ftp://ftp.in-berlin.de/pub/users/weiss/doc/nagios/nagios-2-grayscale.pdf However, the required directives are colored red using CSS, and this doesn't seem to be recognized by html2ps (which I used), sorry. Holger -- PGP fingerprint: F1F0 9071 8084 A426 DD59 9839 59D3 F3A1 B8B5 D3DE ------------------------------------------------------- SF.Net email is sponsored by: Tame your development challenges with Apache's Geronimo App Server. Download it for free - -and be entered to win a 42" plasma tv or your very own Sony(tm)PSP. Click here to play: http://sourceforge.net/geronimo.php _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From robmossrm at aol.com Wed Sep 14 18:18:44 2005 From: robmossrm at aol.com (Rob Moss) Date: Wed, 14 Sep 2005 17:18:44 +0100 Subject: Oracle DB Plugin In-Reply-To: <432848AF.3020804@tgs-solutions.com> References: <432848AF.3020804@tgs-solutions.com> Message-ID: <43284D64.5020502@aol.com> Sudheer Muddappa wrote: > Hi all, > Is there a plugin to monitor the oracle DB? > Please let me know. > > Thanks, > Check in the contributed nagios plugins.. I'm pretty sure I saw one in there (also Sybase and a load of other DB's) http://sourceforge.net/projects/nagiosplug/ rob. ------------------------------------------------------- SF.Net email is sponsored by: Tame your development challenges with Apache's Geronimo App Server. Download it for free - -and be entered to win a 42" plasma tv or your very own Sony(tm)PSP. Click here to play: http://sourceforge.net/geronimo.php _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From enst at rao.elektra.ru Wed Sep 14 18:35:33 2005 From: enst at rao.elektra.ru (Evgeny Stepanov) Date: Wed, 14 Sep 2005 20:35:33 +0400 Subject: Oracle DB Plugin In-Reply-To: <432848AF.3020804@tgs-solutions.com> References: <432848AF.3020804@tgs-solutions.com> Message-ID: <1384540618.20050914203533@rao.elektra.ru> Hello Sudheer, SM> Hi all, SM> Is there a plugin to monitor the oracle DB? SM> Please let me know. You can check nagiosexchange site. There is oracle write access plugin here http://www.nagiosexchange.org/Databases.57.0.html?&tx_netnagext_pi1[p_view]=3 i think it's what you need. Since it's just a bash script utilizing sqlplus utility, you can do whatever you want with it. I did not check it by myself, but looking towards. Will look at you :-) Best regards, Evgeny ------------------------------------------------------- SF.Net email is sponsored by: Tame your development challenges with Apache's Geronimo App Server. Download it for free - -and be entered to win a 42" plasma tv or your very own Sony(tm)PSP. Click here to play: http://sourceforge.net/geronimo.php _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From enst at rao.elektra.ru Wed Sep 14 18:52:22 2005 From: enst at rao.elektra.ru (Evgeny Stepanov) Date: Wed, 14 Sep 2005 20:52:22 +0400 Subject: problem with check_disk_remote (solaris plugin) In-Reply-To: References: Message-ID: <1897575664.20050914205222@rao.elektra.ru> Hello Elizar, EMP> Why is it that when i run the plugin manually, ie, commanline, EMP> the scripts executes without a problem.. example EMP> ? EMP> # ../libexec/check_disk_remote -e rsh? -H k3tsyn1 -w 90 -c 98 EMP> OK: All Filesystems are below threshold (90/98%) | EMP> /=5%;;;0;100 /usr=59%;;;0;100 /var=30%;;;0;100 /opt=58%;;;0;100 EMP> ? EMP> But when i do this in my command.cfg file: EMP> ? EMP> define command{ EMP> ??????? command_name??? check_remote_disk EMP> ??????? command_line??? $USER1$/check_disk_remote -e rsh? -H k3tsyn1 -w 90 -c 98 EMP> ??????? } EMP> ? EMP> (yes, i know that's the macro thing.. :) its not working also, i put it like EMP> that to simulate manual execution) Is it possible your rsh authentication fails? When you run it from the shell it takes your credits, and when launched from nagios, it takes nagios credits. Just a possibility. Best regards Evgeny ------------------------------------------------------- SF.Net email is sponsored by: Tame your development challenges with Apache's Geronimo App Server. Download it for free - -and be entered to win a 42" plasma tv or your very own Sony(tm)PSP. Click here to play: http://sourceforge.net/geronimo.php _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From holger at CIS.FU-Berlin.DE Wed Sep 14 19:15:07 2005 From: holger at CIS.FU-Berlin.DE (Holger Weiss) Date: Wed, 14 Sep 2005 19:15:07 +0200 Subject: Multiple notification_interval settings for th e same service defi nition In-Reply-To: <6B893C5F2902D311A23F0090272854FB0575B80C@litex001.lit.verwalt-berlin.de> References: <6B893C5F2902D311A23F0090272854FB0575B80C@litex001.lit.verwalt-berlin.de> Message-ID: <20050914171507.GE4730212@CIS.FU-Berlin.DE> * Ralph.Grothe at itdz-berlin.de [2005-09-14 17:16]: > Because I would need to run two different event_handlers (viz. one to > file a trouble ticket at a high escalation level only once, and one to > send out repeated nagging email notifications to admins or some other > poor buggers), I might have misunderstood what you're trying to achieve, but if you're using an event_handler for filing the trouble ticket anyway, you can take care of only filing it once (for example, only when the service initially goes into a hard error state) within the event handler script. See http://nagios.sourceforge.net/docs/2_0/eventhandlers.html on how to distinguish between the various states within the script. However, I suppose you _don't_ want to use an event_handler for plain e-mail notifications? So you can then just use the usual configuration directives for them, ignoring the trouble ticket stuff which is handled by your event_handler. No? > this also meant I would have to (re)introduce another couple of > hundred service definitions that only would differ in their > service_description to be referenced accordingly by their respective > serviceescalation definition. Note that you can group services using a servicegroup definition which can then be referenced in the serviceescalation definition. Holger -- PGP fingerprint: F1F0 9071 8084 A426 DD59 9839 59D3 F3A1 B8B5 D3DE ------------------------------------------------------- SF.Net email is sponsored by: Tame your development challenges with Apache's Geronimo App Server. Download it for free - -and be entered to win a 42" plasma tv or your very own Sony(tm)PSP. Click here to play: http://sourceforge.net/geronimo.php _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From trevorwarren at gmail.com Wed Sep 14 19:31:01 2005 From: trevorwarren at gmail.com (Trevor Warren) Date: Wed, 14 Sep 2005 23:01:01 +0530 Subject: Load graph In-Reply-To: <20050914152327.82853.qmail@web53906.mail.yahoo.com> References: <20050914152327.82853.qmail@web53906.mail.yahoo.com> Message-ID: <559e3cb60509141031393b6c9d@mail.gmail.com> Hello Ravi, Have got some scripts written and i can send you the same. But only the day after since i am going to be out of office. Thanks and take care. Trevor On 9/14/05, Ravi Kumar wrote: > > I want to plot graph on load monitoring. > I'm using rrd to create graph but the graph didn't show details. > > *rrdtool create ta2-prod-load.rrd -s 60 DS:load:GAUGE:0:300:U RRA:AVERAGE: > 0.5:1:50400 RRA:AVERA* > > *E:0.5:60:43800* > > ** > > *please help what is wrong in above command,* > > *thanks* > > ** > > ------------------------------ > Yahoo! for Good > Click here to donate to the > Hurricane Katrina relief effort. > > -- ___________________________________ ( >- / Scaling FreeSoftware & OpenSource \ -< ) /~\ / In the Enterprise \ /~\ | \) \ | www.fsf.org | www.opensource.org| / (/ | |_|_ \____________________________________/ _|_| An eye for an eye will soon turn the world blind - MKG -------------- next part -------------- An HTML attachment was scrubbed... URL: From izotov at list.ru Wed Sep 14 19:57:18 2005 From: izotov at list.ru (Izotov Igor) Date: Wed, 14 Sep 2005 21:57:18 +0400 Subject: cluster_check sends no notifications Message-ID: Hello, everyone! The problem is that I do not receive any notifications from CLUSTER_CHECK service. Here's conf file part define service{ host_name host service_description CLUSTER_CHECK is_volatile 0 check_period 24x7 max_check_attempts 1 normal_check_interval 10 retry_check_interval 1 contact_groups admins notification_interval 0 notification_period 24x7 check_command check_service_cluster!"host check by oedipus and igori_virtual"!2!2!$SERVICESTATEID:host:PING_HOST_BY_IGORI_VIRTUAL$,$SERVICE STATEID:host:PING_HOST_BY_OEDIPUS$ active_checks_enabled 1 passive_checks_enabled 0 parallelize_check 0 check_freshness 0 notifications_enabled 1 } PING_HOST_BY_IGORI_VIRTUAL (via nrpe) service and PING_HOST_BY_OEDIPUS (just straight ping) return CRITICAL states, but notifications_enabled = 0 for them. CGI interface says CLUSTER_CHECK is CRITICAL too, just as it should be, but... no notifications are sent. define contactgroup{ contactgroup_name admins alias Nagios Administrators members nagios-admin } define contact{ contact_name nagios-admin alias Nagios Admin service_notification_period 24x7 host_notification_period 24x7 service_notification_options w,u,c,r host_notification_options d,r service_notification_commands service-notify-by-email host_notification_commands host-notify-by-email email izotov at list.ru } The strange thing is that when following service goes into CRITICAL STATE, notifications are sent! define service{ host_name igori_virtual service_description PING_MONITORING_SERVER_IGORI_VIRTUAL is_volatile 0 check_period 24x7 max_check_attempts 1 normal_check_interval 5 retry_check_interval 1 contact_groups admins notification_interval 0 notification_period 24x7 check_command check_host_alive active_checks_enabled 1 passive_checks_enabled 0 parallelize_check 1 check_freshness 0 notifications_enabled 1 } Quite a strange stuff. Anyone got ideas? Thank you. ------------------------------------------------------- SF.Net email is sponsored by: Tame your development challenges with Apache's Geronimo App Server. Download it for free - -and be entered to win a 42" plasma tv or your very own Sony(tm)PSP. Click here to play: http://sourceforge.net/geronimo.php _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From james.mohr at elaxy.com Wed Sep 14 22:16:16 2005 From: james.mohr at elaxy.com (Mohr James) Date: Wed, 14 Sep 2005 22:16:16 +0200 Subject: Nagios plugin to copy large text files Message-ID: Hi All! I was wondering if there was a Nagios plug-in that was able to "copy" large block of text from the agent to the server. This is related to the "NRPE Character limitation" post I made, but I was thinking of a alternate solution. All we are really interested in is getting text files (i.e. /var/log/messages) from the clients to the server. For various reasons, rsync, ftp, and so forth are out of the question, so we need a different solution. The first thought was nrpe, but we have the problem of being able to only send 350 characters. If there is no plug-in, does someone have any ideas or suggestions how we can solve this problem? Any and all help is greatly appreciated. Regards, Jim Mohr ------------------------------------------------------- SF.Net email is sponsored by: Tame your development challenges with Apache's Geronimo App Server. Download it for free - -and be entered to win a 42" plasma tv or your very own Sony(tm)PSP. Click here to play: http://sourceforge.net/geronimo.php _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jason at shakabuku.org Wed Sep 14 22:21:04 2005 From: jason at shakabuku.org (Jason Bodnar) Date: Wed, 14 Sep 2005 15:21:04 -0500 Subject: Nagios plugin to copy large text files In-Reply-To: References: Message-ID: <20050914202000.M30661@shakabuku.org> What about the check_by_ssh plugin? On Wed, 14 Sep 2005 22:16:16 +0200, Mohr James wrote > Hi All! > > I was wondering if there was a Nagios plug-in that was able to "copy" > large block of text from the agent to the server. This is related to > the "NRPE Character limitation" post I made, but I was thinking of a > alternate solution. All we are really interested in is getting text > files (i.e. /var/log/messages) from the clients to the server. For > various reasons, rsync, ftp, and so forth are out of the question, > so we need a different solution. The first thought was nrpe, but we > have the problem of being able to only send 350 characters. If there > is no plug-in, does someone have any ideas or suggestions how we can > solve this problem? Any and all help is greatly appreciated. > > Regards, > > Jim Mohr > > ------------------------------------------------------- > SF.Net email is sponsored by: > Tame your development challenges with Apache's Geronimo App Server. > Download it for free - -and be entered to win a 42" plasma tv or > your very own Sony(tm)PSP. Click here to play: http://sourceforge.net/geronimo.php > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. ::: Messages without supporting info will risk > being sent to /dev/null > > ** CRM114 Whitelisted by: nagios ** -- Jason Bodnar jason at shakabuku.org http://www.shakabuku.org "You want free speech? Let's see you acknowledge a man whose words make your blood boil who is standing center stage advocating at the top of his lungs that which you would spend a lifetime opposing at the top of yours." -- President Andrew Shephard, "The American President" ------------------------------------------------------- SF.Net email is sponsored by: Tame your development challenges with Apache's Geronimo App Server. Download it for free - -and be entered to win a 42" plasma tv or your very own Sony(tm)PSP. Click here to play: http://sourceforge.net/geronimo.php _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Wed Sep 14 23:14:10 2005 From: marc at ena.com (Marc Powell) Date: Wed, 14 Sep 2005 16:14:10 -0500 Subject: Nagios plugin to copy large text files Message-ID: > -----Original Message----- > From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users- > admin at lists.sourceforge.net] On Behalf Of Mohr James > Sent: Wednesday, September 14, 2005 3:16 PM > To: Nagios-users at lists.sourceforge.net > Subject: [Nagios-users] Nagios plugin to copy large text files > > Hi All! > > I was wondering if there was a Nagios plug-in that was able to "copy" > large block of text from the agent to the server. This is related to the > "NRPE Character limitation" post I made, but I was thinking of a > alternate solution. All we are really interested in is getting text > files (i.e. /var/log/messages) from the clients to > the server. For various reasons, rsync, ftp, and so forth are out of the > question, so we need a different solution. The first thought was nrpe, > but we have the problem of being able to only send 350 characters. If > there is no plug-in, does someone have any ideas or suggestions how we > can solve this problem? Any and all help is greatly appreciated. Nagios is not a file management application. NRPE isn't designed to copy files between hosts. Both are designed solely to execute plugins that return one line of text and exit with the proper exit code. Instead of framing your problem to us only as solutions you think might work, why don't you try to explain what you're trying to accomplish in more detail. Something like 'I am trying to use nagios to check|detect|v