Nagios-users digest, Vol 1 #3100 - 19 msgs

Asaf Rapoport arapoport at telepacific.com
Thu Mar 23 21:28:20 CET 2006


Hello Everyone,

I recently upgraded to the 2.0 full release and I have problems with the
mail notification, I tried different macros but I get the same result,
Additional Info is Blank, as seen below

State: CRITICAL

Date/Time: Thu Mar 23 12:24:16 PST 2006

Additional Info:

$



$

PING OK - Packet loss = 0%, RTA = 0.30 ms 

-----Original Message-----
From: nagios-users-admin at lists.sourceforge.net
[mailto:nagios-users-admin at lists.sourceforge.net] On Behalf Of
nagios-users-request at lists.sourceforge.net
Sent: Thursday, March 23, 2006 11:26 AM
To: nagios-users at lists.sourceforge.net
Subject: Nagios-users digest, Vol 1 #3100 - 19 msgs

Send Nagios-users mailing list submissions to
	nagios-users at lists.sourceforge.net

To subscribe or unsubscribe via the World Wide Web, visit
	https://lists.sourceforge.net/lists/listinfo/nagios-users
or, via email, send a message with subject or body 'help' to
	nagios-users-request at lists.sourceforge.net

You can reach the person managing the list at
	nagios-users-admin at lists.sourceforge.net

When replying, please edit your Subject line so it is more specific
than "Re: Contents of Nagios-users digest..."


Today's Topics:

   1. NRPE and $SERVICESTATE$ (Jean Frontin)
   2. Re: SNMP WARNING (James Noyes)
   3. R: [Nagios-users] Check_ntp (Marco Borsani)
   4. Re: Acknowledge issues via e-mail (Rex Wickham)
   5. nagios irc channel available (Morten Bekkelund)
   6. Re: nagios irc channel available (Russell Adams)
   7. Re: nagios irc channel available (Morten Bekkelund)
   8. RE: NRPE and $SERVICESTATE$ (Marc Powell)
   9. RE: Check_ntp (Marc Powell)
  10. R: [Nagios-users] Check_ntp (Marco Borsani)
  11. RE: Check_ntp (Marc Powell)
  12. check_sensors (Jeffrey Montgomery)
  13. Re: check_sensors (Doug Veldhuisen)
  14. RE: Difficulty with wildcards in serviceescalation (Ivan Fetch)
  15. RE: Difficulty with wildcards in serviceescalation (Andrew Laden)
  16. Nagios 'Out Of Memory' Problems (Armistead, Raffy)
  17. RE: Nagios 'Out Of Memory' Problems (Marc Powell)
  18. incomplete information showed by status.cgi (enediel gonzalez)
  19. RE: incomplete information showed by status.cgi (Marc Powell)

--__--__--

Message: 1
Date: Thu, 23 Mar 2006 09:20:39 +0100
To: nagios-users at lists.sourceforge.net
From: Jean Frontin <frontin at irit.fr>
Subject: [Nagios-users] NRPE and $SERVICESTATE$

Hello,

I run nagios 2.0 and nrpe 2.4.

How may I do to modify the $SERVICESTATE$ value resulting from a request of 
nagios ?

I run chekc_nrpe to query nrpe on an host and I need to modify, seeing 
results of running nrpe, $SERVICESTATE$ to obtain a notification alarm

Regards, Many thanks

Jean Frontin



--__--__--

Message: 2
Date: Thu, 23 Mar 2006 01:39:48 -0700
From: James Noyes <jnoyes-nagios at retrogeeks.com>
To: nagios-users at lists.sourceforge.net
Subject: Re: [Nagios-users] SNMP WARNING
Organization: Retrogeeks

On Wed, 22 Mar 2006 05:29:48 -0500 (EST),
"Kyle Tucker" <kylet at panix.com> wrote:
> > > Am I missing something fundamental about this command ?
> > 
> > It appears that you missed the 'min:max' vs. 'max:min' thing on the
ranges,
> >  but 1) that's a common oversight, and 2) it doesn't appear to be the
real
> >  reason you're getting nothing but warnings.
> 
> James, thanks for all this detailed explanation on check_snmp. I am
> using the plugin extensively, but I have a fundamental problem with
> it being designed to give an OK if checks for warning and critical 
> ranges and are within that range. This has for me - and must for
> others - been the source of confusion and need for some brain strain. 
> What do you suppose was the thinking with this approach? The man page
> says it best.
> 
> -w, --warning=INTEGER_RANGE(s)
>     Range(s) which will not result in a WARNING status
> 
> Why not specify the *warning* range using the *warning* option that 
> *WILL* produce a *warning*? It just seems all backwards.

Well, I'm much more a user than developer (although I did finally write my
own Solaris Disksuite plugin out of frustration), and as many times as I
thought about the reason for the min:max vs. max:min behavior, I never could
come up with a plausible reason for doing it this particular way that seemed
"obvious".
About the only reasonable explanation I ever did come up with, and it's
still pretty far-fetched, was that for a majority of SNMP OIDs that can be
queried, the "OK" or "normal" condition is usually one result, while the
"error" condition has many results, giving more detail about the error.  So
I just guessed that it made more sense to specify the "OK" results, rather
than a much larger range of "not OK" results.  Like I said - far-fetched,
but it's really all I could come up with.

-- 
James Noyes
(jnoyes-nagios at retrogeeks.com)


--__--__--

Message: 3
From: "Marco Borsani" <m.borsani at it.net>
To: "'Bill Jacqmein'" <wrjacqmein at gmail.com>
Cc: "'NAGIOS'" <nagios-users at lists.sourceforge.net>
Subject: R: [Nagios-users] Check_ntp
Date: Thu, 23 Mar 2006 10:02:27 +0100
Organization: ITnet

I also tested plugins 1.4.2, but I received same error message

Marco

-}-----Messaggio originale-----
-}Da: Bill Jacqmein [mailto:wrjacqmein at gmail.com]=20
-}Inviato: mercoled=EC 22 marzo 2006 17.34
-}A: Marco Borsani
-}Cc: NAGIOS
-}Oggetto: Re: [Nagios-users] Check_ntp
-}
-}upgrade to the latest version of the plugins at=20
-}http://nagiosplug.sourceforge.net/
-}barring that
-}check that utils.pm is in the libexec directory.
-}
-}
-}On 3/22/06, Marco Borsani <m.borsani at it.net> wrote:
-}> Hi all !
-}>
-}> I'd to use check_ntp, but when I "try" to running it , I obtain:
-}> # ./check_ntp
-}> "use" not allowed in expression at ./check_ntp line 66, at=20
-}end of line=20
-}> syntax error at ./check_ntp line 66, near "libexec use utils "
-}> Execution of ./check_ntp aborted due to compilation errors.
-}>
-}> I don't know perl  :-(
-}>
-}> I am using plugins 1.3.1 and Nagios 1.3 (on a HP-UX 11.00 system).
-}>
-}> Regards
-}>
-}> Marco Borsani
-}> Unix & Monitoring System Administrator Technical Operation
-}> Tel.    +39 010 4310115
-}> Fax     +39 010 4327454
-}> E-mail: m.borsani at IT.net
-}>
-}> ITnet S.r.l. - Direzione e Coordinamento di WIND=20
-}Telecomunicazioni S.p.A.
-}> Internet Service Provider
-}> Sede legale:                     Via C.G.Viola, 48 - 00148 Roma
-}> Dir. Centrale e Amministrativa:         Via Pacinotti, 39
-}>                                      16151 Genova (Italy)
-}>
-}> http://www.it.net
-}> mailto:info at IT.net
-}> _______________________________________________________________
-}> Altre sedi ITnet:
-}> MILANO tel.: +39 02 30114900    info-milano at IT.net
-}> ROMA    tel.: +39 06 83116707    info-roma at IT.net
-}> _______________________________________________________________
-}> ITnet is associated to CIX (Commercial IP eXchange) and=20
-}RIPE ITnet is=20
-}> associated to AIIP (Associazione Italiana Internet Providers)
-}>
-}>
-}>
-}>
-}> -------------------------------------------------------
-}> This SF.Net email is sponsored by xPML, a groundbreaking scripting=20
-}> language that extends applications into web and mobile=20
-}media. Attend=20
-}> the live webcast and join the prime developer group=20
-}breaking into this new coding territory!
-}>=20
-}http://sel.as-us.falkag.net/sel?cmd=3Dlnk&kid=3D110944&bid=3D241720&dat=
=3D1216
-}> 42 _______________________________________________
-}> Nagios-users mailing list
-}> Nagios-users at lists.sourceforge.net
-}> https://lists.sourceforge.net/lists/listinfo/nagios-users
-}> ::: Please include Nagios version, plugin version (-v) and=20
-}OS when reporting any issue.
-}> ::: Messages without supporting info will risk being sent=20
-}to /dev/null
-}>
-}



--__--__--

Message: 4
To: nagios-users at lists.sourceforge.net, rex at 2020media.com
Subject: Re: [Nagios-users] Acknowledge issues via e-mail
From: Rex Wickham <rex at 2020media.com>
Date: Thu, 23 Mar 2006 13:11:03 +0100 (CET)

Hi Dany			=09



I've got Terry's script working in conjunction with NANS=20

(http://www.nagiosexchange.org/Notifications.35.0.html?&tx_netnagext_pi1[=

p_view]=3D186)

A script which aggregates alerts to cut down on "nagios storm"



Regards

REX







...[write your message here]...





- Rex Wickham (2020media)



-----------------------

This thread is located in the archive at this URL:

http://www.nagiosexchange.org/nagios-users.34.0.html?&tx_maillisttofaq_pi=

1[showUid]=3D14819

				=09


--__--__--

Message: 5
From: Morten Bekkelund <morten.bekkelund at ergo.no>
To: nagios <nagios-users at lists.sourceforge.net>
Date: Thu, 23 Mar 2006 14:33:02 +0100
Subject: [Nagios-users] nagios irc channel available

Hi!

There might be nagios-channels out there, but since I didn't find any, I
made my own. A friend of mine and me are working with nagios every day
and have one of the largest implementations registered at nagios.org. We
figured it would be nice to have a dedicated nagios-channel to discuss
options and solutions with fellow users out there.

Feel free to join in.

irc server : irc.irclink.net
port : 6667
channel : #nagios


Regards, Morten.




--__--__--

Message: 6
Date: Thu, 23 Mar 2006 08:05:40 -0600
From: Russell Adams <RLAdams at AdamsInfoServ.Com>
To: nagios <nagios-users at lists.sourceforge.net>
Subject: Re: [Nagios-users] nagios irc channel available

There's already #nagios on freenode.net. I count 40 nicks logged in
currently.

Russell

On Thu, Mar 23, 2006 at 02:33:02PM +0100, Morten Bekkelund wrote:
> Hi!
> 
> There might be nagios-channels out there, but since I didn't find any, I
> made my own. A friend of mine and me are working with nagios every day
> and have one of the largest implementations registered at nagios.org. We
> figured it would be nice to have a dedicated nagios-channel to discuss
> options and solutions with fellow users out there.
> 
> Feel free to join in.
> 
> irc server : irc.irclink.net
> port : 6667
> channel : #nagios
> 
> 
> Regards, Morten.
> 
> 
> 
> 
> -------------------------------------------------------
> This SF.Net email is sponsored by xPML, a groundbreaking scripting
language
> that extends applications into web and mobile media. Attend the live
webcast
> and join the prime developer group breaking into this new coding
territory!
> http://sel.as-us.falkag.net/sel?cmd=lnk&kid=110944&bid=241720&dat=121642
> _______________________________________________
> Nagios-users mailing list
> Nagios-users at lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/nagios-users
> ::: Please include Nagios version, plugin version (-v) and OS when
reporting any issue. 
> ::: Messages without supporting info will risk being sent to /dev/null
------------------------------------------------------------------
Russell Adams                            RLAdams at AdamsInfoServ.com

PGP Key ID:     0x1160DCB3           http://www.adamsinfoserv.com/

Fingerprint:    1723 D8CA 4280 1EC9 557F  66E8 1154 E018 1160 DCB3


--__--__--

Message: 7
Subject: Re: [Nagios-users] nagios irc channel available
From: Morten Bekkelund <morten.bekkelund at ergo.no>
To: Russell Adams <RLAdams at AdamsInfoServ.Com>
Cc: nagios <nagios-users at lists.sourceforge.net>
Date: Thu, 23 Mar 2006 15:13:34 +0100

oh bugger, why didnt I know :P
on my way there

(will still be keeping this one tho)

On Thu, 2006-03-23 at 08:05 -0600, Russell Adams wrote:
> There's already #nagios on freenode.net. I count 40 nicks logged in
currently.
> 
> Russell
> 
> On Thu, Mar 23, 2006 at 02:33:02PM +0100, Morten Bekkelund wrote:
> > Hi!
> > 
> > There might be nagios-channels out there, but since I didn't find any, I
> > made my own. A friend of mine and me are working with nagios every day
> > and have one of the largest implementations registered at nagios.org. We
> > figured it would be nice to have a dedicated nagios-channel to discuss
> > options and solutions with fellow users out there.
> > 
> > Feel free to join in.
> > 
> > irc server : irc.irclink.net
> > port : 6667
> > channel : #nagios
> > 
> > 
> > Regards, Morten.
> > 
> > 
> > 
> > 
> > -------------------------------------------------------
> > This SF.Net email is sponsored by xPML, a groundbreaking scripting
language
> > that extends applications into web and mobile media. Attend the live
webcast
> > and join the prime developer group breaking into this new coding
territory!
> > http://sel.as-us.falkag.net/sel?cmd=lnk&kid=110944&bid=241720&dat=121642
> > _______________________________________________
> > Nagios-users mailing list
> > Nagios-users at lists.sourceforge.net
> > https://lists.sourceforge.net/lists/listinfo/nagios-users
> > ::: Please include Nagios version, plugin version (-v) and OS when
reporting any issue. 
> > ::: Messages without supporting info will risk being sent to /dev/null
> ------------------------------------------------------------------
> Russell Adams                            RLAdams at AdamsInfoServ.com
> 
> PGP Key ID:     0x1160DCB3           http://www.adamsinfoserv.com/
> 
> Fingerprint:    1723 D8CA 4280 1EC9 557F  66E8 1154 E018 1160 DCB3
> 
> 
> -------------------------------------------------------
> This SF.Net email is sponsored by xPML, a groundbreaking scripting
language
> that extends applications into web and mobile media. Attend the live
webcast
> and join the prime developer group breaking into this new coding
territory!
> http://sel.as-us.falkag.net/sel?cmd=lnk&kid=110944&bid=241720&dat=121642
> _______________________________________________
> Nagios-users mailing list
> Nagios-users at lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/nagios-users
> ::: Please include Nagios version, plugin version (-v) and OS when
reporting any issue. 
> ::: Messages without supporting info will risk being sent to /dev/null


--__--__--

Message: 8
Subject: RE: [Nagios-users] NRPE and $SERVICESTATE$
Date: Thu, 23 Mar 2006 09:10:20 -0600
From: "Marc Powell" <marc at ena.com>
To: <nagios-users at lists.sourceforge.net>



> -----Original Message-----
> From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users-
> admin at lists.sourceforge.net] On Behalf Of Jean Frontin
> Sent: Thursday, March 23, 2006 2:21 AM
> To: nagios-users at lists.sourceforge.net
> Subject: [Nagios-users] NRPE and $SERVICESTATE$
>=20
> Hello,
>=20
> I run nagios 2.0 and nrpe 2.4.
>=20
> How may I do to modify the $SERVICESTATE$ value resulting from a
request
> of
> nagios ?
>=20
> I run chekc_nrpe to query nrpe on an host and I need to modify, seeing
> results of running nrpe, $SERVICESTATE$ to obtain a notification alarm

The service state is set by the plugin that is being executed by NRPE
via the plugin's exit code. NRPE simply passes that code back to nagios
unmodified. See
http://nagiosplug.sourceforge.net/developer-guidelines.html#AEN75 for
the proper plugin exit codes for each state.

--
Marc


--__--__--

Message: 9
Subject: RE: [Nagios-users] Check_ntp
Date: Thu, 23 Mar 2006 09:14:16 -0600
From: "Marc Powell" <marc at ena.com>
To: "NAGIOS" <nagios-users at lists.sourceforge.net>

Look around line 66 in the plugin and verify that it looks something =
like --

use lib "/usr/local/nagios/libexec" ;
use utils qw($TIMEOUT %ERRORS &print_revision &support);

--
Marc

> -----Original Message-----
> From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users-
> admin at lists.sourceforge.net] On Behalf Of Marco Borsani
> Sent: Thursday, March 23, 2006 3:02 AM
> To: 'Bill Jacqmein'
> Cc: 'NAGIOS'
> Subject: R: [Nagios-users] Check_ntp
>=20
> I also tested plugins 1.4.2, but I received same error message
>=20
> Marco
>=20
> -}-----Messaggio originale-----
> -}Da: Bill Jacqmein [mailto:wrjacqmein at gmail.com]
> -}Inviato: mercoled=EC 22 marzo 2006 17.34
> -}A: Marco Borsani
> -}Cc: NAGIOS
> -}Oggetto: Re: [Nagios-users] Check_ntp
> -}
> -}upgrade to the latest version of the plugins at
> -}http://nagiosplug.sourceforge.net/
> -}barring that
> -}check that utils.pm is in the libexec directory.
> -}
> -}
> -}On 3/22/06, Marco Borsani <m.borsani at it.net> wrote:
> -}> Hi all !
> -}>
> -}> I'd to use check_ntp, but when I "try" to running it , I obtain:
> -}> # ./check_ntp
> -}> "use" not allowed in expression at ./check_ntp line 66, at
> -}end of line
> -}> syntax error at ./check_ntp line 66, near "libexec use utils "
> -}> Execution of ./check_ntp aborted due to compilation errors.
> -}>
> -}> I don't know perl  :-(
> -}>
> -}> I am using plugins 1.3.1 and Nagios 1.3 (on a HP-UX 11.00 system).
> -}>
> -}> Regards
> -}>
> -}> Marco Borsani
> -}> Unix & Monitoring System Administrator Technical Operation
> -}> Tel.    +39 010 4310115
> -}> Fax     +39 010 4327454
> -}> E-mail: m.borsani at IT.net
> -}>
> -}> ITnet S.r.l. - Direzione e Coordinamento di WIND
> -}Telecomunicazioni S.p.A.
> -}> Internet Service Provider
> -}> Sede legale:                     Via C.G.Viola, 48 - 00148 Roma
> -}> Dir. Centrale e Amministrativa:         Via Pacinotti, 39
> -}>                                      16151 Genova (Italy)
> -}>
> -}> http://www.it.net
> -}> mailto:info at IT.net
> -}> _______________________________________________________________
> -}> Altre sedi ITnet:
> -}> MILANO tel.: +39 02 30114900    info-milano at IT.net
> -}> ROMA    tel.: +39 06 83116707    info-roma at IT.net
> -}> _______________________________________________________________
> -}> ITnet is associated to CIX (Commercial IP eXchange) and
> -}RIPE ITnet is
> -}> associated to AIIP (Associazione Italiana Internet Providers)
> -}>
> -}>
> -}>
> -}>
> -}> -------------------------------------------------------
> -}> This SF.Net email is sponsored by xPML, a groundbreaking scripting
> -}> language that extends applications into web and mobile
> -}media. Attend
> -}> the live webcast and join the prime developer group
> -}breaking into this new coding territory!
> -}>
> =
-}http://sel.as-us.falkag.net/sel?cmd=3Dlnk&kid=3D110944&bid=3D241720&dat=
=3D1216
> -}> 42 _______________________________________________
> -}> Nagios-users mailing list
> -}> Nagios-users at lists.sourceforge.net
> -}> https://lists.sourceforge.net/lists/listinfo/nagios-users
> -}> ::: Please include Nagios version, plugin version (-v) and
> -}OS when reporting any issue.
> -}> ::: Messages without supporting info will risk being sent
> -}to /dev/null
> -}>
> -}
>=20
>=20
>=20
> -------------------------------------------------------
> This SF.Net email is sponsored by xPML, a groundbreaking scripting
> language
> that extends applications into web and mobile media. Attend the live
> webcast
> and join the prime developer group breaking into this new coding
> territory!
> http://sel.as-us.falkag.net/sel?cmd=3Dk&kid=110944&bid$1720&dat=121642
> _______________________________________________
> Nagios-users mailing list
> Nagios-users at lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/nagios-users
> ::: Please include Nagios version, plugin version (-v) and OS when
> reporting any issue.
> ::: Messages without supporting info will risk being sent to /dev/null


--__--__--

Message: 10
From: "Marco Borsani" <m.borsani at it.net>
To: "'Marc Powell'" <marc at ena.com>,
	"'NAGIOS'" <nagios-users at lists.sourceforge.net>
Subject: R: [Nagios-users] Check_ntp
Date: Thu, 23 Mar 2006 16:23:07 +0100
Organization: ITnet

My lines were:

use lib "/usr/local/nagios/libexec" /libexec
use utils qw($TIMEOUT %ERRORS &print_revision &support);=20

Now the command answers correctly !! ;-)

The help on-line (check_ntp -h) is not so easy to understand...

Does check_ntp compare the target host specified time with the locale =
time
(Nagios server time) or what?

Regards
Marco

-}-----Messaggio originale-----
-}Da: nagios-users-admin at lists.sourceforge.net=20
-}[mailto:nagios-users-admin at lists.sourceforge.net] Per conto=20
-}di Marc Powell
-}Inviato: gioved=EC 23 marzo 2006 16.14
-}A: NAGIOS
-}Oggetto: RE: [Nagios-users] Check_ntp
-}
-}Look around line 66 in the plugin and verify that it looks=20
-}something like --
-}
-}use lib "/usr/local/nagios/libexec" ;
-}use utils qw($TIMEOUT %ERRORS &print_revision &support);
-}
-}--
-}Marc
-}
-}> -----Original Message-----
-}> From: nagios-users-admin at lists.sourceforge.net=20
-}[mailto:nagios-users-=20
-}> admin at lists.sourceforge.net] On Behalf Of Marco Borsani
-}> Sent: Thursday, March 23, 2006 3:02 AM
-}> To: 'Bill Jacqmein'
-}> Cc: 'NAGIOS'
-}> Subject: R: [Nagios-users] Check_ntp
-}>=20
-}> I also tested plugins 1.4.2, but I received same error message
-}>=20
-}> Marco
-}>=20
-}> -}-----Messaggio originale-----
-}> -}Da: Bill Jacqmein [mailto:wrjacqmein at gmail.com]
-}> -}Inviato: mercoled=EC 22 marzo 2006 17.34
-}> -}A: Marco Borsani
-}> -}Cc: NAGIOS
-}> -}Oggetto: Re: [Nagios-users] Check_ntp -} -}upgrade to the latest=20
-}> version of the plugins at -}http://nagiosplug.sourceforge.net/
-}> -}barring that
-}> -}check that utils.pm is in the libexec directory.
-}> -}
-}> -}
-}> -}On 3/22/06, Marco Borsani <m.borsani at it.net> wrote:
-}> -}> Hi all !
-}> -}>
-}> -}> I'd to use check_ntp, but when I "try" to running it , I obtain:
-}> -}> # ./check_ntp
-}> -}> "use" not allowed in expression at ./check_ntp line 66,=20
-}at -}end=20
-}> of line -}> syntax error at ./check_ntp line 66, near "libexec use=20
-}> utils "
-}> -}> Execution of ./check_ntp aborted due to compilation errors.
-}> -}>
-}> -}> I don't know perl  :-(
-}> -}>
-}> -}> I am using plugins 1.3.1 and Nagios 1.3 (on a HP-UX=20
-}11.00 system).
-}> -}>
-}> -}> Regards
-}> -}>
-}> -}> Marco Borsani
-}> -}> Unix & Monitoring System Administrator Technical Operation
-}> -}> Tel.    +39 010 4310115
-}> -}> Fax     +39 010 4327454
-}> -}> E-mail: m.borsani at IT.net
-}> -}>
-}> -}> ITnet S.r.l. - Direzione e Coordinamento di WIND=20
-}> -}Telecomunicazioni S.p.A.
-}> -}> Internet Service Provider
-}> -}> Sede legale:                     Via C.G.Viola, 48 - 00148 Roma
-}> -}> Dir. Centrale e Amministrativa:         Via Pacinotti, 39
-}> -}>                                      16151 Genova (Italy)
-}> -}>
-}> -}> http://www.it.net
-}> -}> mailto:info at IT.net
-}> -}> _______________________________________________________________
-}> -}> Altre sedi ITnet:
-}> -}> MILANO tel.: +39 02 30114900    info-milano at IT.net
-}> -}> ROMA    tel.: +39 06 83116707    info-roma at IT.net
-}> -}> _______________________________________________________________
-}> -}> ITnet is associated to CIX (Commercial IP eXchange) and -}RIPE=20
-}> ITnet is -}> associated to AIIP (Associazione Italiana Internet=20
-}> Providers) -}> -}> -}> -}> -}>=20
-}> -------------------------------------------------------
-}> -}> This SF.Net email is sponsored by xPML, a=20
-}groundbreaking scripting=20
-}> -}> language that extends applications into web and mobile -}media.=20
-}> Attend -}> the live webcast and join the prime developer group=20
-}> -}breaking into this new coding territory!
-}> -}>
-}>=20
-}-}http://sel.as-us.falkag.net/sel?cmd=3Dlnk&kid=3D110944&bid=3D241720&d=
at=3D12
-}> 16 -}> 42 _______________________________________________
-}> -}> Nagios-users mailing list
-}> -}> Nagios-users at lists.sourceforge.net
-}> -}> https://lists.sourceforge.net/lists/listinfo/nagios-users
-}> -}> ::: Please include Nagios version, plugin version (-v) and -}OS=20
-}> when reporting any issue.
-}> -}> ::: Messages without supporting info will risk being sent -}to=20
-}> /dev/null -}> -}
-}>=20
-}>=20
-}>=20
-}> -------------------------------------------------------
-}> This SF.Net email is sponsored by xPML, a groundbreaking scripting=20
-}> language that extends applications into web and mobile=20
-}media. Attend=20
-}> the live webcast and join the prime developer group=20
-}breaking into this=20
-}> new coding territory!
-}> =
http://sel.as-us.falkag.net/sel?cmd=3Dk&kid=110944&bid$1720&dat=121642
-}> _______________________________________________
-}> Nagios-users mailing list
-}> Nagios-users at lists.sourceforge.net
-}> https://lists.sourceforge.net/lists/listinfo/nagios-users
-}> ::: Please include Nagios version, plugin version (-v) and OS when=20
-}> reporting any issue.
-}> ::: Messages without supporting info will risk being sent=20
-}to /dev/null
-}
-}
-}-------------------------------------------------------
-}This SF.Net email is sponsored by xPML, a groundbreaking=20
-}scripting language that extends applications into web and=20
-}mobile media. Attend the live webcast and join the prime=20
-}developer group breaking into this new coding territory!
-}http://sel.as-us.falkag.net/sel?cmd=3Dk&kid=110944&bid$1720&dat=121642
-}_______________________________________________
-}Nagios-users mailing list
-}Nagios-users at lists.sourceforge.net
-}https://lists.sourceforge.net/lists/listinfo/nagios-users
-}::: Please include Nagios version, plugin version (-v) and OS=20
-}when reporting any issue.=20
-}::: Messages without supporting info will risk being sent to /dev/null
-}



--__--__--

Message: 11
Subject: RE: [Nagios-users] Check_ntp
Date: Thu, 23 Mar 2006 09:28:40 -0600
From: "Marc Powell" <marc at ena.com>
To: "NAGIOS" <nagios-users at lists.sourceforge.net>



> -----Original Message-----
> From: Marco Borsani [mailto:m.borsani at it.net]
> Sent: Thursday, March 23, 2006 9:23 AM
> To: Marc Powell; 'NAGIOS'
> Subject: R: [Nagios-users] Check_ntp
>=20
> My lines were:
>=20
> use lib "/usr/local/nagios/libexec" /libexec
> use utils qw($TIMEOUT %ERRORS &print_revision &support);
>=20
> Now the command answers correctly !! ;-)
>=20
> The help on-line (check_ntp -h) is not so easy to understand...
>=20
> Does check_ntp compare the target host specified time with the locale
time
> (Nagios server time) or what?

I don't use the plugin but the help says "Checks the local timestamp
offset versus <host> with ntpdate" so I would say the answer is yes.

--
Marc


--__--__--

Message: 12
Date: Thu, 23 Mar 2006 09:47:01 -0600
From: "Jeffrey Montgomery" <jrmontg at gmail.com>
To: nagios-users at lists.sourceforge.net
Subject: [Nagios-users] check_sensors

------=_Part_6150_12434221.1143128821479
Content-Type: text/plain; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable
Content-Disposition: inline

Can anyone explain how to use the check_sensor plugin?  I installed
lm_sensors and when I run
./check_sensors 10.10.18.23 I get WARNING - sensors returned state 1.

Thanks
--
Jeff

------=_Part_6150_12434221.1143128821479
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable
Content-Disposition: inline

Can anyone explain how to use the check_sensor plugin?  I installed lm=
_sensors and when I run <br clear=3D"all">./check_sensors <a href=3D"http:/=
/10.10.18.23">10.10.18.23</a> I get WARNING - sensors returned state 1.<br>=
<br>Thanks
<br>-- <br>Jeff<br>

------=_Part_6150_12434221.1143128821479--


--__--__--

Message: 13
Date: Thu, 23 Mar 2006 10:41:32 -0600
From: Doug Veldhuisen <dveldhuisen at bartlesville.oilfield.slb.com>
Subject: Re: [Nagios-users] check_sensors
To: Jeffrey Montgomery <jrmontg at gmail.com>,
nagios-users at lists.sourceforge.net


--Boundary_(ID_bA5PrlQ+ELfwrR+7UINlFA)
Content-type: text/plain; charset=us-ascii; format=flowed
Content-transfer-encoding: 7BIT


Jeffrey:

You should check to make sure that lm_sensors package is really 
running.  Quite a number of machines don't support that package.  The 
program starts then quitely dies.  Example: Dell's don't support it at all. 
(or if I'm incorrect on that someone in the group correct me and then tell 
me how to make it work)  An IBM intellistation which I thought might, won't 
either.

try a ps -ef | grep sensors

Doug


At 09:47 AM 3/23/2006, Jeffrey Montgomery wrote:
>Can anyone explain how to use the check_sensor plugin?  I installed 
>lm_sensors and when I run
>./check_sensors <http://10.10.18.23>10.10.18.23 I get WARNING - sensors 
>returned state 1.
>
>Thanks
>--
>Jeff

--Boundary_(ID_bA5PrlQ+ELfwrR+7UINlFA)
Content-type: text/html; charset=us-ascii
Content-transfer-encoding: 7BIT

<html>
<body>
<font size=3><br>
Jeffrey:<br><br>
You should check to make sure that lm_sensors package is really
running.  Quite a number of machines don't support that
package.  The program starts then quitely dies.  Example:
Dell's don't support it at all. (or if I'm incorrect on that someone in
the group correct me and then tell me how to make it work)  An IBM
intellistation which I thought might, won't either.<br><br>
try a ps -ef | grep sensors<br><br>
Doug<br><br>
<br>
At 09:47 AM 3/23/2006, Jeffrey Montgomery wrote:<br>
<blockquote type=cite class=cite cite="">Can anyone explain how to use
the check_sensor plugin?  I installed lm_sensors and when I run
<br>
./check_sensors <a href="http://10.10.18.23">10.10.18.23</a> I get
WARNING - sensors returned state 1.<br><br>
Thanks <br>
-- <br>
Jeff</font></blockquote></body>
</html>

--Boundary_(ID_bA5PrlQ+ELfwrR+7UINlFA)--


--__--__--

Message: 14
Date: Thu, 23 Mar 2006 10:37:26 -0700 (MST)
From: Ivan Fetch <ifetch at du.edu>
Subject: RE: [Nagios-users] Difficulty with wildcards in serviceescalation
To: Andrew Laden <Andrew.Laden at tudor.com>
Cc: nagios-users at lists.sourceforge.net

Hello,

    Andrew, I wanted to thank you for your reply, and mention that yes this 
does work as I wanted.

    I am curious what setting use_regexp_matching to 1, but 
use_true_regexp_matching to 0 would bring to the table (given that the 
docs apparently mis-state these options).


Thanks - Ivan.
  On Tue, 21 Feb 2006, Andrew 
Laden wrote:

> There is a documentation error. You have to use real regex. A naked * by
> itself is not a valid regex.
>
> Try .*
>
> -Andrew
>
>> -----Original Message-----
>> From: nagios-users-admin at lists.sourceforge.net
>> [mailto:nagios-users-admin at lists.sourceforge.net] On Behalf
>> Of Ivan Fetch
>> Sent: Monday, February 20, 2006 5:25 PM
>> To: nagios-users at lists.sourceforge.net
>> Subject: [Nagios-users] Difficulty with wildcards in serviceescalation
>>
>> Hello,
>>
>>
>>     I'm having difficulties with wildcards in a
>> serviceescalation, running nagios 2.0-stable.  It seems like
>> the documentation discusses doing exactly this, so perhaps my
>> trying to get this to work last minute is causing me to
>> overlook something...  Any ideas?
>>
>>
>> # Limit notifications to a max of 5, the last one notifying everyone.
>> define serviceescalation{
>> host_name       *
>> service_description *
>> first_notification 5
>> last_notification 0
>> notification_interval 0
>> contact_groups everyone
>> }
>>
>>
>>
>>     When verifying the config I get:
>>
>> "Reading configuration data...
>>
>> Error: Could not find any host matching '*'
>> Error: Could not expand hostgroups and/or hosts specified in
>> service escalation (config file
>> '/usr/local/nagios/etc/du/services.cfg', starting on line 154)"
>>
>>
>>
>>     I have this set in /usr/local/nagios/etc/nagios.cfg:
>> use_regexp_matching=1
>> use_true_regexp_matching=0
>> # ... at the end of the file ...
>> cfg_dir=/usr/local/nagios/etc/du
>> # (which is where the services.cfg is read, which is where the
>> #    serviceescalation is defined)
>>
>>
>>
>> Thanks,
>>
>> Ivan Fetch.
>> '
>>
>>
>> -------------------------------------------------------
>> This SF.net email is sponsored by: Splunk Inc. Do you grep
>> through log files for problems?  Stop!  Download the new AJAX
>> search engine that makes searching your log files as easy as
>> surfing the  web.  DOWNLOAD SPLUNK!
>> http://sel.as-us.falkag.net/sel?cmd=lnk&kid=103432&bid=230486&
> dat=121642
>> _______________________________________________
>> Nagios-users mailing list
>> Nagios-users at lists.sourceforge.net
>> https://lists.sourceforge.net/lists/listinfo/nagios-users
>> ::: Please include Nagios version, plugin version (-v) and OS
>> when reporting any issue.
>> ::: Messages without supporting info will risk being sent to /dev/null
>>
>


--__--__--

Message: 15
From: Andrew Laden <Andrew.Laden at tudor.com>
To: Ivan Fetch <ifetch at du.edu>
Cc: nagios-users at lists.sourceforge.net
Subject: RE: [Nagios-users] Difficulty with wildcards in serviceescalation
Date: Thu, 23 Mar 2006 12:58:35 -0500

 I believe the docs are pretty much correct on this one.

Use_regexp_matching=1 will turn on regex for entries that contain a "*" or
"?" only.

Use_true_regex_matching=1 will turn it on completely, which while it sounds
neat, will cause lots of headaches.

Consider if you have a host named "mile", and another named "smiles".
If you have true_regex on, you will never be able to only reference the host
"mile" as every entry for it will match "smiles". (You would have to use
"^mile$" to be safe)

That's why is recommended to keep it off unless you really know what you are
doing.


> -----Original Message-----
> From: Ivan Fetch [mailto:ifetch at du.edu] 
> Sent: Thursday, March 23, 2006 12:37 PM
> To: Andrew Laden
> Cc: nagios-users at lists.sourceforge.net
> Subject: RE: [Nagios-users] Difficulty with wildcards in 
> serviceescalation
> 
> Hello,
> 
>     Andrew, I wanted to thank you for your reply, and mention 
> that yes this does work as I wanted.
> 
>     I am curious what setting use_regexp_matching to 1, but 
> use_true_regexp_matching to 0 would bring to the table (given 
> that the docs apparently mis-state these options).
> 
> 
> Thanks - Ivan.
>   On Tue, 21 Feb 2006, Andrew
> Laden wrote:
> 
> > There is a documentation error. You have to use real regex. 
> A naked * 
> > by itself is not a valid regex.
> >
> > Try .*
> >
> > -Andrew
> >
> >> -----Original Message-----
> >> From: nagios-users-admin at lists.sourceforge.net
> >> [mailto:nagios-users-admin at lists.sourceforge.net] On 
> Behalf Of Ivan 
> >> Fetch
> >> Sent: Monday, February 20, 2006 5:25 PM
> >> To: nagios-users at lists.sourceforge.net
> >> Subject: [Nagios-users] Difficulty with wildcards in 
> >> serviceescalation
> >>
> >> Hello,
> >>
> >>
> >>     I'm having difficulties with wildcards in a serviceescalation, 
> >> running nagios 2.0-stable.  It seems like the 
> documentation discusses 
> >> doing exactly this, so perhaps my trying to get this to work last 
> >> minute is causing me to overlook something...  Any ideas?
> >>
> >>
> >> # Limit notifications to a max of 5, the last one 
> notifying everyone.
> >> define serviceescalation{
> >> host_name       *
> >> service_description *
> >> first_notification 5
> >> last_notification 0
> >> notification_interval 0
> >> contact_groups everyone
> >> }
> >>
> >>
> >>
> >>     When verifying the config I get:
> >>
> >> "Reading configuration data...
> >>
> >> Error: Could not find any host matching '*'
> >> Error: Could not expand hostgroups and/or hosts specified 
> in service 
> >> escalation (config file '/usr/local/nagios/etc/du/services.cfg', 
> >> starting on line 154)"
> >>
> >>
> >>
> >>     I have this set in /usr/local/nagios/etc/nagios.cfg:
> >> use_regexp_matching=1
> >> use_true_regexp_matching=0
> >> # ... at the end of the file ...
> >> cfg_dir=/usr/local/nagios/etc/du
> >> # (which is where the services.cfg is read, which is where the
> >> #    serviceescalation is defined)
> >>
> >>
> >>
> >> Thanks,
> >>
> >> Ivan Fetch.
> >> '
> >>
> >>
> >> -------------------------------------------------------
> >> This SF.net email is sponsored by: Splunk Inc. Do you grep through 
> >> log files for problems?  Stop!  Download the new AJAX 
> search engine 
> >> that makes searching your log files as easy as surfing the  web.  
> >> DOWNLOAD SPLUNK!
> >> http://sel.as-us.falkag.net/sel?cmd=lnk&kid=103432&bid=230486&
> > dat=121642
> >> _______________________________________________
> >> Nagios-users mailing list
> >> Nagios-users at lists.sourceforge.net
> >> https://lists.sourceforge.net/lists/listinfo/nagios-users
> >> ::: Please include Nagios version, plugin version (-v) and OS when 
> >> reporting any issue.
> >> ::: Messages without supporting info will risk being sent to 
> >> /dev/null
> >>
> >
> 


--__--__--

Message: 16
Date: Thu, 23 Mar 2006 10:23:13 -0800
From: "Armistead, Raffy" <rarmistead at datanamicsinc.com>
To: <nagios-users at lists.sourceforge.net>
Subject: [Nagios-users] Nagios 'Out Of Memory' Problems

This is a multi-part message in MIME format.

------_=_NextPart_001_01C64EA6.D8E6FE73
Content-Type: text/plain;
	charset="US-ASCII"
Content-Transfer-Encoding: quoted-printable

I have a problem with my Nagios server constantly crashing. It keeps
outputting on the screen Out of Memory errors which causes loss of
access to the server. I can ping the box but I cannot SSH or web into it
to view any information. This has been happening increasingly more
lately. Now it is about every 2-3 days that this is occurring. We have
been adding more and more devices to the servers and this problem has
been increasing as this occurs. This is how I have it set up.

=20

I have a Main Nagios server that is running the latest 2.0 (stable)
Nagios release. It is monitoring about 6800 devices but it is not
actively checking the devices. Its main role is to provide a web
interface and receive passive polls from three other servers which do
the polling. The main server also does email notifications when a device
goes down. The server sends about 30-40 emails a day. I am using NSCA
2.5 between the server and the client Nagios servers. I am only
monitoring one service for each device which is either TCP or ping
depending on the device. Mostly all devices are monitored with TCP
(roughly 6000). The rest are monitored with ping. The individual servers
are pretty evenly spread with the number of devices. They are about
2000-2500 each.

=20

All the servers are just basic computers, Dell Dimension 2400s with base
hardware. The main server was upgraded to 2GB RAM while the other
servers are running 512MB each. They are all running Celeron 2.4 GHz
processors. The individual servers are not having out of memory problems
and they are running the latest 2.0 (stable) release as well. They all
run RedHat 9.0 with everything installed for the packages.

=20

Can someone please help me in resolving this problem? Thanks.

=20

=20

=20

=20

=20

=20

The TOP process does not appear like it is running out of memory. This
is the normal output when the server has been running for a few hours.

57 processes: 54 sleeping, 3 running, 0 zombie, 0 stopped

CPU states:  41.1% user  58.8% system   0.0% nice   0.0% iowait   0.0%
idle

Mem:  2063556k av,  285940k used, 1777616k free,       0k shrd,   41056k
buff

                    177644k actv,   51688k in_d,   10892k in_c

Swap: 1044184k av,       0k used, 1044184k free                  114208k
cached

=20

=20

=20

Here is a sample configuration that I have on the devices on the main
server:

=20

hosts.cfg

define host {

name                           generic-host     ; The name of this host
template - referenced in other host definitions, used for template
recursion/resolution

notifications_enabled          1        ; Host notifications are enabled

event_handler_enabled          0        ; Host event handler is enabled

flap_detection_enabled         1        ; Flap detection is enabled

process_perf_data              1        ; Process performance data

retain_status_information      1        ; Retain status information
across program restarts

retain_nonstatus_information   1        ; Retain non-status information
across program restarts

max_check_attempts             10

notification_interval          720

notification_period            24x7

obsess_over_host               0

notification_options           d,u,r,f

register                       0        ; DONT REGISTER THIS DEFINITION
- ITS NOT A REAL HOST, JUST A TEMPLATE!

}

define host {

use                            generic-host          ; Name of host
template to use

host_name                      DETAH-R1

alias                          DETAH-R1

address                        x.x.x.x

check_command                  check_ping!200,40%!10000,100%

contact_groups                 device-admins,DETAH-admins,router-admins

}

=20

services.cfg

define service {

name                           generic-service  ; The 'name' of this
service template, referenced in other service definitions

active_checks_enabled          0        ; Active service checks are
enabled

passive_checks_enabled         1        ; Passive service checks are
enabled/accepted

parallelize_check              1        ; Active service checks should
be parallelized (disabling this can lead to major performance problems)

obsess_over_service            0        ; We should obsess over this
service (if necessary)

check_freshness                1        ; Default is to NOT check
service 'freshness'

freshness_threshold            1800

notifications_enabled          1        ; Service notifications are
enabled

event_handler_enabled          0        ; Service event handler is
enabled

flap_detection_enabled         1        ; Flap detection is enabled

process_perf_data              1        ; Process performance data

retain_status_information      1        ; Retain status information
across program restarts

retain_nonstatus_information   1        ; Retain non-status information
across program restarts

is_volatile                    0

check_period                   24x7

max_check_attempts             6

normal_check_interval          20

retry_check_interval           5

notification_interval          720

notification_period            24x7

notification_options           n

register                       0        ; DONT REGISTER THIS DEFINITION
- ITS NOT A REAL SERVICE, JUST A TEMPLATE!

}

define service {

use                            generic-service          ; Name of
service template to use

host_name                      DETAH-R1

service_description            PING

contact_groups                 device-admins,DETAH-admins,router-admins

check_command                  check_ping!200,40%!1000,100%

}

=20

Here is a sample config on the individual server.

=20

hosts.cfg

define host {

name                           generic-host     ; The name of this host
template - referenced in other host definitions, used for template
recursion/resolution

notifications_enabled          1        ; Host notifications are enabled

event_handler_enabled          0        ; Host event handler is enabled

flap_detection_enabled         1        ; Flap detection is enabled

process_perf_data              1        ; Process performance data

retain_status_information      1        ; Retain status information
across program restarts

retain_nonstatus_information   1        ; Retain non-status information
across program restarts

max_check_attempts             10

notification_interval          720

notification_period            24x7

obsess_over_host               0

notification_options           d,u,r,f

register                       0        ; DONT REGISTER THIS DEFINITION
- ITS NOT A REAL HOST, JUST A TEMPLATE!

}

define host {

use                            generic-host          ; Name of host
template to use

host_name                      DETAH-R1

alias                          DETAH-R1

address                        x.x.x.x

check_command                  check_ping!200,40%!10000,100%

contact_groups                 device-admins,DETAH-admins,router-admins

}

=20

services.cfg

define service {

name                           generic-service  ; The 'name' of this
service template, referenced in other service definitions

active_checks_enabled          1        ; Active service checks are
enabled

passive_checks_enabled         1        ; Passive service checks are
enabled/accepted

parallelize_check              1        ; Active service checks should
be parallelized (disabling this can lead to major performance problems)

obsess_over_service            1        ; We should obsess over this
service (if necessary)

check_freshness                1        ; Default is to NOT check
service 'freshness'

freshness_threshold            1800

notifications_enabled          1        ; Service notifications are
enabled

event_handler_enabled          0        ; Service event handler is
enabled

flap_detection_enabled         1        ; Flap detection is enabled

process_perf_data              1        ; Process performance data

retain_status_information      1        ; Retain status information
across program restarts

retain_nonstatus_information   1        ; Retain non-status information
across program restarts

is_volatile                    0

check_period                   24x7

max_check_attempts             6

normal_check_interval          20

retry_check_interval           5

notification_interval          720

notification_period            24x7

notification_options           n

register                       0        ; DONT REGISTER THIS DEFINITION
- ITS NOT A REAL SERVICE, JUST A TEMPLATE!

}

define service {

use                            generic-service          ; Name of
service template to use

host_name                      DETAH-R1

service_description            PING

contact_groups                 device-admins,DETAH-admins,router-admins

check_command                  check_ping!200,40%!1000,100%

}

=20

Raffy

=20


------_=_NextPart_001_01C64EA6.D8E6FE73
Content-Type: text/html;
	charset="US-ASCII"
Content-Transfer-Encoding: quoted-printable

<html xmlns:o=3D"urn:schemas-microsoft-com:office:office" =
xmlns:w=3D"urn:schemas-microsoft-com:office:word" =
xmlns:st1=3D"urn:schemas-microsoft-com:office:smarttags" =
xmlns=3D"http://www.w3.org/TR/REC-html40">

<head>
<meta http-equiv=3DContent-Type content=3D"text/html; =
charset=3Dus-ascii">
<meta name=3DGenerator content=3D"Microsoft Word 11 (filtered medium)">
<o:SmartTagType =
namespaceuri=3D"urn:schemas-microsoft-com:office:smarttags"
 name=3D"place"/>
<!--[if !mso]>
<style>
st1\:*{behavior:url(#default#ieooui) }
</style>
<![endif]-->
<style>
<!--
 /* Style Definitions */
 p.MsoNormal, li.MsoNormal, div.MsoNormal
	{margin:0in;
	margin-bottom:.0001pt;
	font-size:12.0pt;
	font-family:"Times New Roman";}
a:link, span.MsoHyperlink
	{color:blue;
	text-decoration:underline;}
a:visited, span.MsoHyperlinkFollowed
	{color:purple;
	text-decoration:underline;}
span.EmailStyle17
	{mso-style-type:personal-compose;
	font-family:Arial;
	color:windowtext;}
@page Section1
	{size:8.5in 11.0in;
	margin:1.0in 1.25in 1.0in 1.25in;}
div.Section1
	{page:Section1;}
-->
</style>

</head>

<body lang=3DEN-US link=3Dblue vlink=3Dpurple>

<div class=3DSection1>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>I have a problem with my Nagios server constantly =
crashing.
It keeps outputting on the screen Out of Memory errors which causes loss =
of
access to the server. I can ping the box but I cannot SSH or web into it =
to
view any information. This has been happening increasingly more lately. =
Now it
is about every 2-3 days that this is occurring. We have been adding more =
and
more devices to the servers and this problem has been increasing as this
occurs. This is how I have it set up.<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'><o:p> </o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>I have a Main Nagios server that is running the =
latest 2.0
(stable) Nagios release. It is monitoring about 6800 devices but it is =
not
actively checking the devices. Its main role is to provide a web =
interface and
receive passive polls from three other servers which do the polling. The =
main
server also does email notifications when a device goes down. The server =
sends
about 30-40 emails a day. I am using NSCA 2.5 between the server and the =
client
Nagios servers. I am only monitoring one service for each device which =
is
either TCP or ping depending on the device. Mostly all devices are =
monitored
with TCP (roughly 6000). The rest are monitored with ping. The =
individual
servers are pretty evenly spread with the number of devices. They are =
about
2000-2500 each.<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'><o:p> </o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>All the servers are just basic computers, Dell =
Dimension
2400s with base hardware. The main server was upgraded to 2GB RAM while =
the
other servers are running 512MB each. They are all running Celeron 2.4 =
GHz
processors. The individual servers are not having out of memory problems =
and
they are running the latest 2.0 (stable) release as well. They all run =
RedHat
9.0 with everything installed for the =
packages.<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'><o:p> </o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>Can someone please help me in resolving this problem?
Thanks.<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'><o:p> </o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'><o:p> </o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'><o:p> </o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'><o:p> </o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'><o:p> </o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'><o:p> </o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>The TOP process does not appear like it is running =
out of
memory. This is the normal output when the server has been running for a =
few
hours.<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>57 processes: 54 sleeping, 3 running, 0 zombie, 0 =
stopped<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>CPU states:  41.1% user  58.8% =
system  
0.0% nice   0.0% iowait   0.0% =
idle<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>Mem:  2063556k av,  285940k used, 1777616k =
free,      
0k shrd,   41056k buff<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>         =
          
177644k actv,   51688k in_d,   10892k =
in_c<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>Swap: 1044184k =
av,       0k
used, 1044184k
free           &nb=
sp;     
114208k cached<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'><o:p> </o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'><o:p> </o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'><o:p> </o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>Here is a sample configuration that I have on the =
devices on
the main server:<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'><o:p> </o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>hosts.cfg<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>define host {<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>name        &n=
bsp;           &nb=
sp;     
generic-host     ; The name of this host template -
referenced in other host definitions, used for template =
recursion/resolution<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>notifications_enabled     &nb=
sp;   
1        ; Host notifications are =
enabled<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>event_handler_enabled     &nb=
sp;   
0        ; Host event handler is =
enabled<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>flap_detection_enabled     &n=
bsp;  
1        ; Flap detection is =
enabled<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>process_perf_data      &=
nbsp;      
1        ; Process performance =
data<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>retain_status_information    &nbsp=
;
1        ; Retain status information =
across
program restarts<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>retain_nonstatus_information  
1        ; Retain non-status =
information
across program restarts<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>max_check_attempts      =
      
10<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>notification_interval     &nb=
sp;   
720<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>notification_period     &nbsp=
;     
24x7<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>obsess_over_host      &n=
bsp;       
0<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>notification_options     &nbs=
p;    
d,u,r,f<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>register       &nbs=
p;           &nbsp=
;  
0        ; DONT REGISTER THIS =
DEFINITION -
ITS NOT A REAL HOST, JUST A TEMPLATE!<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>}<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>define host {<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>use        &nb=
sp;           &nbs=
p;      
generic-host          ; =
Name of
host template to use<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>host_name       &nb=
sp;           &nbs=
p; 
DETAH-R1<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>alias        &=
nbsp;           &n=
bsp;    
DETAH-R1<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>address       &nbsp=
;            =
   
x.x.x.x<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>check_command      &nbsp=
;          
check_ping!200,40%!10000,100%<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>contact_groups      &nbs=
p;         
device-admins,DETAH-admins,router-admins<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>}<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'><o:p> </o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>services.cfg<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>define service {<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>name        &n=
bsp;           &nb=
sp;     
generic-service  ; The 'name' of this service template, referenced =
in
other service definitions<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>active_checks_enabled     &nb=
sp;   
0        ; Active service checks are =
enabled<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>passive_checks_enabled     &n=
bsp;  
1        ; Passive service checks are
enabled/accepted<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>parallelize_check      &=
nbsp;      
1        ; Active service checks =
should be
parallelized (disabling this can lead to major performance =
problems)<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>obsess_over_service     &nbsp=
; 
    0        ; We =
should
obsess over this service (if necessary)<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>check_freshness      &nb=
sp;        
1        ; Default is to NOT check =
service
'freshness'<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>freshness_threshold     &nbsp=
;     
1800<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>notifications_enabled     &nb=
sp;   
1        ; Service notifications are =
enabled<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>event_handler_enabled     &nb=
sp;   
0        ; Service event handler is =
enabled<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>flap_detection_enabled     &n=
bsp;  
1        ; Flap detection is =
enabled<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>process_perf_data      &=
nbsp;      
1        ; Process performance =
data<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>retain_status_information    &nbsp=
;
1        ; Retain status information =
across
program restarts<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>retain_nonstatus_information  
1        ; Retain non-status =
information
across program restarts<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>is_volatile       &=
nbsp;           
0<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>check_period       =
           
24x7<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>max_check_attempts      =
      
6<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>normal_check_interval     &nb=
sp;   
20<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>retry_check_interval     &nbs=
p;    
5<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>notification_interval     &nb=
sp;   
720<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>notification_period     &nbsp=
;     
24x7<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>notification_options     &nbs=
p;    
n<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>register       &nbs=
p;           &nbsp=
;  
0        ; DONT REGISTER THIS =
DEFINITION -
ITS NOT A REAL SERVICE, JUST A TEMPLATE!<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>}<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>define service {<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>use        &nb=
sp;           &nbs=
p;      
generic-service          ; =
Name of
service template to use<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>host_name       &nb=
sp;           &nbs=
p; 
DETAH-R1<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>service_description     &nbsp=
;     
<st1:place w:st=3D"on">PING</st1:place><o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>contact_groups      &nbs=
p;         
device-admins,DETAH-admins,router-admins<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>check_command      &nbsp=
;          
check_ping!200,40%!1000,100%<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>}<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'><o:p> </o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>Here is a sample config on the individual =
server.<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'><o:p> </o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>hosts.cfg<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>define host {<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>name        &n=
bsp;           &nb=
sp;     
generic-host     ; The name of this host template -
referenced in other host definitions, used for template =
recursion/resolution<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>notifications_enabled     &nb=
sp;   
1        ; Host notifications are =
enabled<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>event_handler_enabled     &nb=
sp;   
0        ; Host event handler is =
enabled<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>flap_detection_enabled     &n=
bsp;  
1        ; Flap detection is =
enabled<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>process_perf_data      &=
nbsp;      
1        ; Process performance =
data<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>retain_status_information    &nbsp=
;
1        ; Retain status information =
across
program restarts<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>retain_nonstatus_information  
1        ; Retain non-status =
information
across program restarts<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>max_check_attempts      =
      
10<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>notification_interval     &nb=
sp;   
720<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>notification_period     &nbsp=
;     
24x7<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>obsess_over_host      &n=
bsp;       
0<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>notification_options     &nbs=
p;    
d,u,r,f<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>register =
            &=
nbsp;         0  &=
nbsp;    
; DONT REGISTER THIS DEFINITION - ITS NOT A REAL HOST, JUST A =
TEMPLATE!<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>}<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>define host {<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>use        &nb=
sp;           &nbs=
p;      
generic-host          ; =
Name of
host template to use<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>host_name       &nb=
sp;           &nbs=
p; 
DETAH-R1<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>alias        =
            &=
nbsp;     DETAH-R1<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>address       &nbsp=
;            =
   
x.x.x.x<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>check_command      &nbsp=
;          
check_ping!200,40%!10000,100%<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>contact_groups      &nbs=
p;         
device-admins,DETAH-admins,router-admins<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>}<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'><o:p> </o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>services.cfg<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>define service {<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>name        &n=
bsp;           &nb=
sp;     
generic-service  ; The 'name' of this service template, referenced =
in
other service definitions<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>active_checks_enabled     &nb=
sp;   
1        ; Active service checks are =
enabled<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>passive_checks_enabled     &n=
bsp;  
1        ; Passive service checks are
enabled/accepted<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>parallelize_check      &=
nbsp;      
1        ; Active service checks =
should be
parallelized (disabling this can lead to major performance =
problems)<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>obsess_over_service     &nbsp=
;     
1        ; We should obsess over this
service (if necessary)<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>check_freshness      &nb=
sp;        
1        ; Default is to NOT check =
service
'freshness'<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>freshness_threshold     &nbsp=
;     
1800<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>notifications_enabled     &nb=
sp;   
1        ; Service notifications are =
enabled<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>event_handler_enabled     &nb=
sp;   
0        ; Service event handler is =
enabled<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>flap_detection_enabled     &n=
bsp;  
1        ; Flap detection is =
enabled<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>process_perf_data      &=
nbsp;      
1        ; Process performance =
data<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>retain_status_information    &nbsp=
;
1        ; Retain status information =
across
program restarts<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>retain_nonstatus_information  
1        ; Retain non-status =
information
across program restarts<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>is_volatile       &=
nbsp;           
0<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>check_period       =
           
24x7<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>max_check_attempts      =
      
6<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>normal_check_interval     &nb=
sp;   
20<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>retry_check_interval    =
       5<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>notification_interval     &nb=
sp;   
720<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>notification_period     &nbsp=
;     
24x7<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>notification_options     &nbs=
p;    
n<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>register       &nbs=
p;           &nbsp=
;  
0        ; DONT REGISTER THIS =
DEFINITION -
ITS NOT A REAL SERVICE, JUST A TEMPLATE!<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>}<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>define service {<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>use        &nb=
sp;           &nbs=
p;      
generic-service          ; =
Name of
service template to use<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>host_name       &nb=
sp;           &nbs=
p; 
DETAH-R1<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>service_description     &nbsp=
;     
<st1:place w:st=3D"on">PING</st1:place><o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>contact_groups      &nbs=
p;         
device-admins,DETAH-admins,router-admins<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>check_command     =
            &=
nbsp;check_ping!200,40%!1000,100%<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>}<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'><o:p> </o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 face=3DArial><span =
style=3D'font-size:10.0pt;
font-family:Arial'>Raffy</span></font><o:p></o:p></p>

<p class=3DMsoNormal><font size=3D3 face=3D"Times New Roman"><span =
style=3D'font-size:
12.0pt'><o:p> </o:p></span></font></p>

</div>

</body>

</html>

------_=_NextPart_001_01C64EA6.D8E6FE73--


--__--__--

Message: 17
Subject: RE: [Nagios-users] Nagios 'Out Of Memory' Problems
Date: Thu, 23 Mar 2006 13:12:14 -0600
From: "Marc Powell" <marc at ena.com>
To: <nagios-users at lists.sourceforge.net>



> -----Original Message-----
> From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users-
> admin at lists.sourceforge.net] On Behalf Of Armistead, Raffy
> Sent: Thursday, March 23, 2006 12:23 PM
> To: nagios-users at lists.sourceforge.net
> Subject: [Nagios-users] Nagios 'Out Of Memory' Problems
>=20
> I have a problem with my Nagios server constantly crashing. It keeps
> outputting on the screen Out of Memory errors which causes loss of
access
> to the server. I can ping the box but I cannot SSH or web into it to
view
> any information. This has been happening increasingly more lately. Now
it
> is about every 2-3 days that this is occurring. We have been adding
more
> and more devices to the servers and this problem has been increasing
as
> this occurs. This is how I have it set up.
>=20
>=20
>=20
> I have a Main Nagios server that is running the latest 2.0 (stable)
Nagios
> release. It is monitoring about 6800 devices but it is not actively
> checking the devices. Its main role is to provide a web interface and
> receive passive polls from three other servers which do the polling.
The
> main server also does email notifications when a device goes down. The
> server sends about 30-40 emails a day. I am using NSCA 2.5 between the
> server and the client Nagios servers. I am only monitoring one service
for
> each device which is either TCP or ping depending on the device.
Mostly
> all devices are monitored with TCP (roughly 6000). The rest are
monitored
> with ping. The individual servers are pretty evenly spread with the
number
> of devices. They are about 2000-2500 each.=20
>=20
> Can someone please help me in resolving this problem? Thanks

Have you determined what process is using the memory? One of the first
steps you should take is to set appropriate ulimits for memory
utilization for that user so that it doesn't bring down the server. I
would configure nagios to monitor memory on that server then use top or
ps to identify the process(es) using the allocated memory when memory
utilization is high. That will provide better direction for
troubleshooting rather than simply that the machine is crashing due to
memory exhaustion. The nagios deamon itself isn't going to be using a
lot of RAM (10M on my box with 3400 passive services).

My somewhat unfounded guess is that perhaps nagios isn't reaping the
results from NSCA frequently enough so you're having a backlog of ncsa
processes. Each process uses just a little memory but if you have
thousands of them then it adds up. I've personally experienced this on a
machine that was experiencing disk problems. If this is the case, beyond
a hardware problem or capacity issue, I'd verify that your
command_check_interval is set to -1 to make sure that nagios is checking
the external command file as quickly as it can.

--
Marc=20


--__--__--

Message: 18
From: "enediel gonzalez" <enediel at hotmail.com>
To: nagios-users at lists.sourceforge.net
Date: Thu, 23 Mar 2006 19:20:26 +0000
Subject: [Nagios-users] incomplete information showed by status.cgi

Hello

I have nagios running on my debian box. I declared 1 host with about 20 
monitor (for testing purposes)

When I invoke status.cgi, sometimes appear all defined monitor for that host

and the rest of declared hosts, and sometimes appear a subset of monitors 
defined for the host and I don't see the rest of hosts.

I there something I can do to overpass this problem.

Thanks in advance for any help

Regards
Enediel
Linux user 300141
Debian GNU/Linux




--__--__--

Message: 19
Subject: RE: [Nagios-users] incomplete information showed by status.cgi
Date: Thu, 23 Mar 2006 13:25:07 -0600
From: "Marc Powell" <marc at ena.com>
To: <nagios-users at lists.sourceforge.net>



> -----Original Message-----
> From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users-
> admin at lists.sourceforge.net] On Behalf Of enediel gonzalez
> Sent: Thursday, March 23, 2006 1:20 PM
> To: nagios-users at lists.sourceforge.net
> Subject: [Nagios-users] incomplete information showed by status.cgi
>=20
> Hello
>=20
> I have nagios running on my debian box. I declared 1 host with about
20
> monitor (for testing purposes)
>=20
> When I invoke status.cgi, sometimes appear all defined monitor for
that
> host
> and the rest of declared hosts, and sometimes appear a subset of
monitors
> defined for the host and I don't see the rest of hosts.
>=20
> I there something I can do to overpass this problem.

Presuming nagios-1.x --
http://www.nagios.org/faqs/viewfaq.php?faq_id=3D21&expand=3Dfalse&showdes=
c=3Df
alse

Nagios-2.0 shouldn't experience this problem.

--
Marc=20



--__--__--

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users


End of Nagios-users Digest
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://www.monitoring-lists.org/archive/users/attachments/20060323/b51ae042/attachment.html>


More information about the Users mailing list