Nagios-users digest, Vol 1 #2754 - 16 msgs

Arif Snort arif.snort at gmail.com
Mon Aug 15 01:30:30 CEST 2005


i am sorry..i am a newbe make linux and nagios...can you help me, how
to install and compile nagios with mysql ?
ok thanks before.. 

On 8/12/05, nagios-users-request at lists.sourceforge.net
<nagios-users-request at lists.sourceforge.net> wrote:
> Send Nagios-users mailing list submissions to
> 	nagios-users at lists.sourceforge.net
> 
> To subscribe or unsubscribe via the World Wide Web, visit
> 	https://lists.sourceforge.net/lists/listinfo/nagios-users
> or, via email, send a message with subject or body 'help' to
> 	nagios-users-request at lists.sourceforge.net
> 
> You can reach the person managing the list at
> 	nagios-users-admin at lists.sourceforge.net
> 
> When replying, please edit your Subject line so it is more specific
> than "Re: Contents of Nagios-users digest..."
> 
> 
> Today's Topics:
> 
>    1. False Negatives (David Johnson)
>    2. RE: False Negatives (Hosey, Chester)
>    3. RE: False Negatives (David Johnson)
>    4. RE: False Negatives (David Johnson)
>    5. Service Went Down, No notification sent... (John McGowan)
>    6. RE: False Negatives (Marc Powell)
>    7. Re: Service Went Down, No notification sent... (Andreas Ericsson)
>    8. Re: why can't a servicegroup have no members? (Kevin Hanser)
>    9. RE: why can't a servicegroup have no members? (Marc Powell)
>   10. Re: Service Went Down, No notification sent... (John McGowan)
>   11. missing var/objects.cache (naguser at bhodisoft.com)
>   12. RE: missing var/objects.cache (Marc Powell)
>   13. RE: missing var/objects.cache (naguser at bhodisoft.com)
>   14. Disable pager alerts for Acknowledgements ? (Michael J McCafferty)
>   15. external commands audit trail (Marc Martinez)
>   16. =?ISO-8859-1?Q?Multiple_orphaned_nagios_processes?= (Brian Murphy)
> 
> --__--__--
> 
> Message: 1
> Reply-To: <djohnson at jsatech.com>
> From: "David Johnson" <djohnson at jsatech.com>
> To: <nagios-users at lists.sourceforge.net>
> Date: Thu, 11 Aug 2005 10:20:15 -0500
> Subject: [Nagios-users] False Negatives
> 
> Long time user, first time emailer...
> 
> We have been loving nagios and we've even submitted some source code =
> improvements.  However, we have a problem with false negatives.  We have =
> 2 remote servers we monitor in which the SSH service is being reported =
> as down, while the host is being reported as up.  Both of these servers =
> are operating properly.  In addition, the host check command and service =
> check commands are identical.  We have 40 other servers which operate in =
> exactly the same way, yet the SSH service on these 2 is being reported =
> as down.  Exact same versions of SSH on all servers also.  The only =
> difference is location. =20
> 
> 
> 
> --__--__--
> 
> Message: 2
> Subject: RE: [Nagios-users] False Negatives
> Date: Thu, 11 Aug 2005 11:23:27 -0400
> From: "Hosey, Chester" <Chester.Hosey at gianteagle.com>
> To: <djohnson at jsatech.com>,
> 	<nagios-users at lists.sourceforge.net>
> 
> Have you run the service check plugins manually to see what it returns?
> Is it just an occasional failure?
> 
> -----Original Message-----
> From: nagios-users-admin at lists.sourceforge.net
> [mailto:nagios-users-admin at lists.sourceforge.net] On Behalf Of David
> Johnson
> Sent: Thursday, August 11, 2005 11:20 AM
> To: nagios-users at lists.sourceforge.net
> Subject: [Nagios-users] False Negatives
> 
> 
> Long time user, first time emailer...
> 
> We have been loving nagios and we've even submitted some source code
> improvements.  However, we have a problem with false negatives.  We have
> 2 remote servers we monitor in which the SSH service is being reported
> as down, while the host is being reported as up.  Both of these servers
> are operating properly.  In addition, the host check command and service
> check commands are identical.  We have 40 other servers which operate in
> exactly the same way, yet the SSH service on these 2 is being reported
> as down.  Exact same versions of SSH on all servers also.  The only
> difference is location. =20
> 
> 
> 
> -------------------------------------------------------
> SF.Net email is Sponsored by the Better Software Conference & EXPO
> September 19-22, 2005 * San Francisco, CA * Development Lifecycle
> Practices Agile & Plan-Driven Development * Managing Projects & Teams *
> Testing & QA Security * Process Improvement & Measurement *
> http://www.sqe.com/bsce5sf
> _______________________________________________
> Nagios-users mailing list
> Nagios-users at lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/nagios-users
> ::: Please include Nagios version, plugin version (-v) and OS when
> reporting any issue.=20
> ::: Messages without supporting info will risk being sent to /dev/null
> 
> 
> --__--__--
> 
> Message: 3
> Reply-To: <djohnson at jsatech.com>
> From: "David Johnson" <djohnson at jsatech.com>
> To: "Hosey, Chester" <Chester.Hosey at gianteagle.com>,
> 	<nagios-users at lists.sourceforge.net>
> Subject: RE: [Nagios-users] False Negatives
> Date: Thu, 11 Aug 2005 10:36:33 -0500
> 
> I've run them manually both through NRPE and directly on the host that =
> runs them.  All returns okay.  Exactly the same as all other hosts.  I =
> thought perhaps it might be time delay, so I set the timeout on my =
> check_ssh command to 30 seconds (from the default of 10).  I did the =
> same with NRPE.
> 
> 
> -----Original Message-----
> From: Hosey, Chester [mailto:Chester.Hosey at gianteagle.com]
> Sent: Thursday, August 11, 2005 10:23 AM
> To: djohnson at jsatech.com; nagios-users at lists.sourceforge.net
> Subject: RE: [Nagios-users] False Negatives
> 
> 
> Have you run the service check plugins manually to see what it returns?
> Is it just an occasional failure?
> 
> -----Original Message-----
> From: nagios-users-admin at lists.sourceforge.net
> [mailto:nagios-users-admin at lists.sourceforge.net] On Behalf Of David
> Johnson
> Sent: Thursday, August 11, 2005 11:20 AM
> To: nagios-users at lists.sourceforge.net
> Subject: [Nagios-users] False Negatives
> 
> 
> Long time user, first time emailer...
> 
> We have been loving nagios and we've even submitted some source code
> improvements.  However, we have a problem with false negatives.  We have
> 2 remote servers we monitor in which the SSH service is being reported
> as down, while the host is being reported as up.  Both of these servers
> are operating properly.  In addition, the host check command and service
> check commands are identical.  We have 40 other servers which operate in
> exactly the same way, yet the SSH service on these 2 is being reported
> as down.  Exact same versions of SSH on all servers also.  The only
> difference is location. =20
> 
> 
> 
> -------------------------------------------------------
> SF.Net email is Sponsored by the Better Software Conference & EXPO
> September 19-22, 2005 * San Francisco, CA * Development Lifecycle
> Practices Agile & Plan-Driven Development * Managing Projects & Teams *
> Testing & QA Security * Process Improvement & Measurement *
> http://www.sqe.com/bsce5sf
> _______________________________________________
> Nagios-users mailing list
> Nagios-users at lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/nagios-users
> ::: Please include Nagios version, plugin version (-v) and OS when
> reporting any issue.=20
> ::: Messages without supporting info will risk being sent to /dev/null
> 
> 
> 
> --__--__--
> 
> Message: 4
> Reply-To: <djohnson at jsatech.com>
> From: "David Johnson" <djohnson at jsatech.com>
> To: "Hosey, Chester" <Chester.Hosey at gianteagle.com>,
> 	<nagios-users at lists.sourceforge.net>
> Subject: RE: [Nagios-users] False Negatives
> Date: Thu, 11 Aug 2005 10:36:33 -0500
> 
> The manual running of the plugins succeeds every time.  The nagios =
> running fails every time.  I must say that I get two different failures.
> 
> 1. One falsely failing server returns: "Server answer:"
> 2. The other falsely failing server returns a CHECK_NRPE timeout of 10 =
> seconds (which is odd because I set the timeout to 30 and did a hard =
> restart). =20
> 
> 
> -----Original Message-----
> From: nagios-users-admin at lists.sourceforge.net
> [mailto:nagios-users-admin at lists.sourceforge.net]On Behalf Of Hosey,
> Chester
> Sent: Thursday, August 11, 2005 10:23 AM
> To: djohnson at jsatech.com; nagios-users at lists.sourceforge.net
> Subject: RE: [Nagios-users] False Negatives
> 
> 
> Have you run the service check plugins manually to see what it returns?
> Is it just an occasional failure?
> 
> -----Original Message-----
> From: nagios-users-admin at lists.sourceforge.net
> [mailto:nagios-users-admin at lists.sourceforge.net] On Behalf Of David
> Johnson
> Sent: Thursday, August 11, 2005 11:20 AM
> To: nagios-users at lists.sourceforge.net
> Subject: [Nagios-users] False Negatives
> 
> 
> Long time user, first time emailer...
> 
> We have been loving nagios and we've even submitted some source code
> improvements.  However, we have a problem with false negatives.  We have
> 2 remote servers we monitor in which the SSH service is being reported
> as down, while the host is being reported as up.  Both of these servers
> are operating properly.  In addition, the host check command and service
> check commands are identical.  We have 40 other servers which operate in
> exactly the same way, yet the SSH service on these 2 is being reported
> as down.  Exact same versions of SSH on all servers also.  The only
> difference is location. =20
> 
> 
> 
> -------------------------------------------------------
> SF.Net email is Sponsored by the Better Software Conference & EXPO
> September 19-22, 2005 * San Francisco, CA * Development Lifecycle
> Practices Agile & Plan-Driven Development * Managing Projects & Teams *
> Testing & QA Security * Process Improvement & Measurement *
> http://www.sqe.com/bsce5sf
> _______________________________________________
> Nagios-users mailing list
> Nagios-users at lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/nagios-users
> ::: Please include Nagios version, plugin version (-v) and OS when
> reporting any issue.=20
> ::: Messages without supporting info will risk being sent to /dev/null
> 
> 
> -------------------------------------------------------
> SF.Net email is Sponsored by the Better Software Conference & EXPO
> September 19-22, 2005 * San Francisco, CA * Development Lifecycle =
> Practices
> Agile & Plan-Driven Development * Managing Projects & Teams * Testing & =
> QA
> Security * Process Improvement & Measurement * =
> http://www.sqe.com/bsce5sf
> _______________________________________________
> Nagios-users mailing list
> Nagios-users at lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/nagios-users
> ::: Please include Nagios version, plugin version (-v) and OS when =
> reporting any issue.=20
> ::: Messages without supporting info will risk being sent to /dev/null
> 
> 
> 
> --__--__--
> 
> Message: 5
> Date: Thu, 11 Aug 2005 10:36:42 -0500
> From: John McGowan <mcgowan at lynch2.com>
> To: Nagios Users List <nagios-users at lists.sourceforge.net>
> Subject: [Nagios-users] Service Went Down, No notification sent...
> 
> I'm not sure exactly where to start on this.... last night i checked on 
> my services and noticed a service that was down and had been down for 
> about 20 minutes.  No notification was ever sent out... this is what I 
> saw in the log...
> 
> [08-10-2005 22:07:36] SERVICE ALERT: tessweb;Tessitura 
> SeatServer;CRITICAL;HARD;1;CRITICAL - Socket timeout after 10 seconds
> 
> When I ran a test this morning this is what I saw in the event log.
> 
> [08-11-2005 10:17:39] SERVICE ALERT: tessweb;Tessitura 
> SeatServer;CRITICAL;SOFT;1;No route to host
> [08-11-2005 10:18:08] SERVICE ALERT: tessweb;Tessitura 
> SeatServer;CRITICAL;SOFT;2;No route to host
> [08-11-2005 10:18:38] SERVICE ALERT: tessweb;Tessitura 
> SeatServer;CRITICAL;SOFT;3;No route to host
> [08-11-2005 10:19:08] SERVICE ALERT: tessweb;Tessitura 
> SeatServer;CRITICAL;SOFT;4;No route to host
> [08-11-2005 10:19:38] SERVICE ALERT: tessweb;Tessitura 
> SeatServer;CRITICAL;HARD;5;No route to host
> 
> The first thing that stood out when I saw it was the fact that the max 
> check attempts didn't seem to make a difference last night... the 
> service went critical hard on the first failure...
> 
> FYI: the particular thing that caused this failure last night was that 
> the VPN between here and there was down.
> 
> the service definition is listed below... with it's template
> 
> 
> 
> define service{
>     name                generic-service    ; The 'name' of this service 
> template, referenced in other service definitions
>     active_checks_enabled        1    ; Active service checks are enabled
>     passive_checks_enabled        1    ; Passive service checks are 
> enabled/accepted
>     parallelize_check        1    ; Active service checks should be 
> parallelized (disabling this can lead to major performance problems)
>     obsess_over_service        1    ; We should obsess over this service 
> (if necessary)
>     check_freshness            0    ; Default is to NOT check service 
> 'freshness'
>     notifications_enabled        1    ; Service notifications are enabled
>     event_handler_enabled        1    ; Service event handler is enabled
>     flap_detection_enabled        1    ; Flap detection is enabled
>     process_perf_data        1    ; Process performance data
>     retain_status_information    1    ; Retain status information across 
> program restarts
>     retain_nonstatus_information    1    ; Retain non-status information 
> across program restarts
>         is_volatile            0
>    
>         check_period            24x7
>         notification_period             24x7
>     notification_interval        120
>     notification_options        w,u,c,r
>     contact_groups            admins
>         max_check_attempts        5
>     normal_check_interval        60
>     retry_check_interval        30
> 
> 
>     register            0    ; DONT REGISTER THIS DEFINITION - ITS NOT A 
> REAL SERVICE, JUST A TEMPLATE!
>     }
> 
> # Service definition
> define service{
>     use                generic-service        ; Name of service template 
> to use
>         contact_groups            csoadmins
>         host_name            tessweb
>     service_description        Tessitura SeatServer
>     check_command            
> check_http_site2_ssl!tessweb.cso.org!true!/Tessitura.asmx/WebSeatServerListening
> }
> 
> 
> 
> 
> --__--__--
> 
> Message: 6
> Subject: RE: [Nagios-users] False Negatives
> Date: Thu, 11 Aug 2005 10:52:27 -0500
> From: "Marc Powell" <marc at ena.com>
> To: <nagios-users at lists.sourceforge.net>
> 
> 
> 
> > -----Original Message-----
> > From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users-
> > admin at lists.sourceforge.net] On Behalf Of David Johnson
> > Sent: Thursday, August 11, 2005 10:37 AM
> > To: Hosey, Chester; nagios-users at lists.sourceforge.net
> > Subject: RE: [Nagios-users] False Negatives
> >=20
> > The manual running of the plugins succeeds every time.  The nagios
> running
> > fails every time.  I must say that I get two different failures.
> >=20
> > 1. One falsely failing server returns: "Server answer:"
> 
> Looking at the src for check_ssh (which I assume you are using), it
> makes a connection on the specified port and looks for "SSH" as the
> first 3 characters in the first line of output. If it doesn't find it
> then it prints "Server answer:" then the text that it did see. In your
> case it looks like it's either getting a blank line or nothing at all.
> I'd try that test manually using telnet several times in a row just to
> make sure that you're seeing the expected output. Perhaps also turn up
> logging for your sshd. Maybe there's some indication of the problem
> there.
> 
> > 2. The other falsely failing server returns a CHECK_NRPE timeout of 10
> > seconds (which is odd because I set the timeout to 30 and did a hard
> > restart).
> 
> There are two places where the timeout could be coming from. Each plugin
> may have their own timeout value (usually -t) and nagios has a master
> timeout value in nagios.cfg (service_check_timeout). The master timeout
> will always override the plugin timeout value if it's shorter (since
> nagios has no idea what the plugin timeout might be). If -t=3D45 for a
> plugin but service_check_timeout is 10 then you'll still only get 10
> seconds.
> 
> --
> Marc
> 
> 
> --__--__--
> 
> Message: 7
> Date: Thu, 11 Aug 2005 17:58:14 +0200
> From: Andreas Ericsson <ae at op5.se>
> To: Nagios Users List <nagios-users at lists.sourceforge.net>
> Subject: Re: [Nagios-users] Service Went Down, No notification sent...
> 
> John McGowan wrote:
> > I'm not sure exactly where to start on this.... last night i checked on 
> > my services and noticed a service that was down and had been down for 
> > about 20 minutes.  No notification was ever sent out... this is what I 
> > saw in the log...
> > 
> > [08-10-2005 22:07:36] SERVICE ALERT: tessweb;Tessitura 
> > SeatServer;CRITICAL;HARD;1;CRITICAL - Socket timeout after 10 seconds
> > 
> > When I ran a test this morning this is what I saw in the event log.
> > 
>   > The first thing that stood out when I saw it was the fact that the max
> > check attempts didn't seem to make a difference last night... the 
> > service went critical hard on the first failure...
> > 
> 
> This is because the host was down as well. Service checks aren't 
> reported if the host is down, and they're set to HARD immediately to 
> save the added strain of having to re-check them frequently.
> 
> > FYI: the particular thing that caused this failure last night was that 
> > the VPN between here and there was down.
> > 
> 
> So the hostcheck couldn't possibly have succeeded then, and the host was 
> most likely in UNREACHABLE state (if you monitor the VPN tunnel in both 
> ends and have parents set up properly).
> 
> What notifications *did* you get? An unreachable for the VPN tunnel?
> 
> -- 
> Andreas Ericsson                   andreas.ericsson at op5.se
> OP5 AB                             www.op5.se
> Lead Developer
> 
> 
> --__--__--
> 
> Message: 8
> Subject: Re: [Nagios-users] why can't a servicegroup have no members?
> From: Kevin Hanser <kevin at mica.net>
> To: nagios-users at lists.sourceforge.net,
>         "Chester R. Hosey" <Chester.Hosey at gianteagle.com>
> Date: Thu, 11 Aug 2005 13:52:30 -0400
> 
> Hmmm... went back and tried it, and sure enough it's working for me.  I
> think it was complaining before because I had a servicegroup with no
> members, and also didn't have any services defined with the
> "servicegroups" directive.  Looks like one or the other is required..
> 
> I was confused because I created my servicegroup first (w/out any
> members) and then did a config check.  At that point it complained that
> the servicegroup had no members and wouldn't go.  Apparently though
> after I've added the "servicegroup" directive to some services, it's no
> longer complaining about the "members" being missing.
> 
> So I guess I need one or the other, I was only getting an error because
> I had just created the servicegroup and not added anything to it yet.
> 
> thx!
> 
> k
> 
> On Wed, 2005-08-10 at 17:28 -0400, Chester R. Hosey wrote:
> > Using 2.0b4 also:
> > 
> > define service{
> >         use                             nrpe-load-check
> >         host_name                       chet-linux-desktop
> >         servicegroups test
> > }
> > 
> > define servicegroup{
> >         servicegroup_name       test
> >         alias                   Sample Group
> > }
> > 
> > 
> > /usr/bin/nagios -v /etc/nagios/nagios.cfg liked it.
> > 
> > Chet
> > 
> > 
> > On Wed, 2005-08-10 at 17:14 -0400, Kevin Hanser wrote:
> > > I am running Nagios 2.0b4, and I'm trying to create some servicegroups
> > > for nice display and whatnot.
> > > 
> > > I can get them to work fine, as long as I specify at least one member
> in
> > > the servicegroups definition... However, I'd like to create a
> > > servicegroup for "router interfaces", and I don't really want to type
> in
> > > every description for every router interface I have on every router,
> > > when I've already put it into the service definition.  I see that I can
> > > specify servicegroups in the service definition and that would be
> > > perfect for me, except for one thing:  nagios won't let me create a
> > > servicegroup w/out any members.
> > > 
> > > I would like to create the servicegroup w/no members in its definition,
> > > and then use the "servicegroups" directive w/in the service definition
> > > to make each interface a member of the servicegroup.  So I guess my
> > > question is... Why is the "members" directive required when defining a
> > > servicegroup?
> > > 
> > > Thx!
> > > 
> > > k
> > > 
> > > 
> > > -------------------------------------------------------
> > > SF.Net email is Sponsored by the Better Software Conference & EXPO
> > > September 19-22, 2005 * San Francisco, CA * Development Lifecycle
> Practices
> > > Agile & Plan-Driven Development * Managing Projects & Teams * Testing &
> QA
> > > Security * Process Improvement & Measurement *
> http://www.sqe.com/bsce5sf
> > > _______________________________________________
> > > Nagios-users mailing list
> > > Nagios-users at lists.sourceforge.net
> > > https://lists.sourceforge.net/lists/listinfo/nagios-users
> > > ::: Please include Nagios version, plugin version (-v) and OS when
> reporting any issue. 
> > > ::: Messages without supporting info will risk being sent to /dev/null
> 
> 
> --__--__--
> 
> Message: 9
> Subject: RE: [Nagios-users] why can't a servicegroup have no members?
> Date: Thu, 11 Aug 2005 13:08:39 -0500
> From: "Marc Powell" <marc at ena.com>
> To: <nagios-users at lists.sourceforge.net>
> 
> 
> 
> > -----Original Message-----
> > From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users-
> > admin at lists.sourceforge.net] On Behalf Of Kevin Hanser
> > Sent: Thursday, August 11, 2005 12:53 PM
> > To: nagios-users at lists.sourceforge.net; Chester R. Hosey
> > Subject: Re: [Nagios-users] why can't a servicegroup have no members?
> >=20
> > Hmmm... went back and tried it, and sure enough it's working for me.
> I
> > think it was complaining before because I had a servicegroup with no
> > members, and also didn't have any services defined with the
> > "servicegroups" directive.  Looks like one or the other is required..
> >=20
> 
> Which makes sense.
> 
> > I was confused because I created my servicegroup first (w/out any
> > members) and then did a config check.  At that point it complained
> that
> > the servicegroup had no members and wouldn't go.  Apparently though
> > after I've added the "servicegroup" directive to some services, it's
> no
> > longer complaining about the "members" being missing.
> 
> Because it has members now, a 'member' indicating a service that is part
> of the group, not specifically the 'members' directive for the
> definition. The terminology may be slightly confusing but the concept
> shouldn't be. A servicegroup (or any group) has to have members (e.g.
> services in this case) associated with it, either through the
> servicegroup members definition or through the service servicegroups
> definition. A group with no members is meaningless.
> 
> --
> Marc=20
> 
> 
> --__--__--
> 
> Message: 10
> Date: Thu, 11 Aug 2005 14:19:43 -0500
> From: John McGowan <mcgowan at lynch2.com>
> To: Nagios Users List <nagios-users at lists.sourceforge.net>
> Subject: Re: [Nagios-users] Service Went Down, No notification sent...
> 
> 
> 
> Andreas Ericsson wrote:
> 
> >> This is because the host was down as well. Service checks aren't 
> >> reported if the host is down, and they're set to HARD immediately to 
> >> save the added strain of having to re-check them frequently.
> >
> Ah that makes perfect sense... My problem is that I've been using hosts 
> as more of an organizational tool, just grouping services together.
> 
> > What notifications *did* you get? An unreachable for the VPN tunnel?
> 
> No notification at all, because the host wasn't setup properly
> 
> 
> --__--__--
> 
> Message: 11
> Date: Thu, 11 Aug 2005 14:15:50 -0700 (PDT)
> From: naguser at bhodisoft.com
> To: nagios-users at lists.sourceforge.net
> Subject: [Nagios-users] missing var/objects.cache
> 
> I'm setting up the web interfaces on a pair of existing distributed nodes.
> On one of them, I get this whoops message "Error: Could not read object
> configuration data!".
> 
> I compared the two nodes and on the one that is giving me the error, I'm
> missing a var/objects.cache file. The nodes have the same
> object_cache_file setting in etc/nagios.cfg and grep isn't turning up any
> other related settings that I can see.
> 
> Am I right in suspecting that this missing file is the most likely source
> of the problem? If so, how do I create that file? The error message says
> that I should "make sure you've compiled the main program and the CGIs to
> use the same object data storage options", but to the best of my knowledge
> there was no difference between the two installs.
> 
> I've tried just 'touch'ing it and reloading nagios, but no luck with that.
> It is still empty and I'm still getting the error.
> 
> Thanks,
> 
> -G_E
> 
> 
> 
> --__--__--
> 
> Message: 12
> Subject: RE: [Nagios-users] missing var/objects.cache
> Date: Thu, 11 Aug 2005 16:45:30 -0500
> From: "Marc Powell" <marc at ena.com>
> To: <nagios-users at lists.sourceforge.net>
> 
> 
> 
> > -----Original Message-----
> > From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users-
> > admin at lists.sourceforge.net] On Behalf Of naguser at bhodisoft.com
> > Sent: Thursday, August 11, 2005 4:16 PM
> > To: nagios-users at lists.sourceforge.net
> > Subject: [Nagios-users] missing var/objects.cache
> >=20
> > I'm setting up the web interfaces on a pair of existing distributed
> nodes.
> > On one of them, I get this whoops message "Error: Could not read
> object
> > configuration data!".
> >=20
> > I compared the two nodes and on the one that is giving me the error,
> I'm
> > missing a var/objects.cache file. The nodes have the same
> > object_cache_file setting in etc/nagios.cfg and grep isn't turning up
> any
> > other related settings that I can see.
> >=20
> > Am I right in suspecting that this missing file is the most likely
> source
> 
> Yes.
> 
> > of the problem? If so, how do I create that file? The error message
> says
> 
> You can't. Nagios creates it when the daemon is started. It's an
> optimized version of all your config files. Is nagios running? Does it
> have permissions to write to that directory?
> 
> --
> Marc
> 
> 
> --__--__--
> 
> Message: 13
> Date: Thu, 11 Aug 2005 15:09:09 -0700 (PDT)
> Subject: RE: [Nagios-users] missing var/objects.cache
> From: naguser at bhodisoft.com
> To: nagios-users at lists.sourceforge.net
> 
> >> of the problem? If so, how do I create that file? The error message
> > says
> >
> > You can't. Nagios creates it when the daemon is started. It's an
> > optimized version of all your config files. Is nagios running? Does it
> > have permissions to write to that directory?
> 
> Thanks for the quick reply.
> 
> Turns out there WAS a difference on the object_cache_file setting after
> all; I just wasn't seeing it. My bad. For some reason it was actually
> pointing to var/nagios/objects.cache and the nagios subdirectory didn't
> exist. I fixed the cfg file and its all good now.
> 
> Don't mind me, its just one of those days.
> 
> -G_E
> 
> 
> 
> --__--__--
> 
> Message: 14
> Date: Thu, 11 Aug 2005 16:01:24 -0700
> From: Michael J McCafferty <mike at m5computersecurity.com>
> To: nagios-users at lists.sourceforge.net
> Subject: [Nagios-users] Disable pager alerts for Acknowledgements ?
> 
> All,
> 
>    We have some alerts go to a pager that is in the NOC, for audible alerts
> (can't seem to get audible alerts to work with passive service checks, but
> audible alerts work great with active service checks and active host
> checks).
> My NOC people asked if I can get the alerts to only get send to the pager
> when
> something goes down, not once each for down/critical, acknowledgement and
> up/recovery.
>    I notice the scripts in misccommands.cfg use the variable
> $NOTIFICATIONTYPE$
> included in the alert. I suppose I could write some kind of wrapper that
> gets
> called instead of going straight to /bin/mail, but I was hoping there was a
> better (faster, easier, more direct) way.
> 
>    Thoughts ?
> 
> Thanks,
> Mike
> 
> -- 
> ************************************************************
> Michael J. McCafferty
> Principal, Security Engineer
> M5 Hosting
> 858-576-7325 Voice
> http://www.m5hosting.com
> ************************************************************
> 
> 
> 
> --__--__--
> 
> Message: 15
> Date: Thu, 11 Aug 2005 16:02:02 -0700
> From: Marc Martinez <lastxit at gmail.com>
> To: nagios-users at lists.sourceforge.net
> Subject: [Nagios-users] external commands audit trail
> 
> I'm hoping to get some suggestions about how to get an audit trail for
> external command submissions from the cgi interface..
> 
> all the google and list archive digging I've done has only turned up
> permissions problems and the like, so if there is an existing resource
> to address this please send me on my way..
> 
> the crux of the problem though is that while cmd.cgi does verify the
> auth credentials, the username is only recorded for comment
> submissions..
> 
> what's the best way to approach this?
> 
> thanks,
> Marc
> 
> 
> --__--__--
> 
> Message: 16
> Date: Fri, 12 Aug 2005 03:57:06 +0200 (MEST)
> From: "Brian Murphy" <brian.murphy at gmx.net>
> To: nagios-users at lists.sourceforge.net
> Subject: [Nagios-users] =?ISO-8859-1?Q?Multiple_orphaned_nagios_processes?=
> 
> Hi There
> 
> Running nagios is a distributed setup, 2 systems carrying out active checks
> and sending results to central display node using nsca.
> 
> What we are seeing is that we end up with hundreds of nagios processes on
> the central node, enough to grind it to a stop. Over 2000 checks are being
> carried out at the checking nodes.
> 
> We set the  service_reaper_frequency to 3 on both the central and the
> chicking systems and still have the problem.
> 
> We had this problem on 2.0b3 and still have it on 2.0b4
> 
> I am suspecting that the processes are spawned off to process the passive
> checks, but collide writing into the pipe back to nagios (or it is full and
> EAGAIN) and end up eventually just orphaned out there.
> 
> How many checks per second should nagios be able to process? We seem to be
> writing to the logfile at 100/sec sometimes.
> 
> Do I just need to slow things down somehow? bigger poll cycles with the
> checks...
> 
> We are running a perfdata command and an ocsp_command on the central host,
> processes forked by these do not seem to be the problem.
> 
> 
> Config file bits below
> 
> status_file=/usr/local/nagios/var/status.dat
> nagios_user=nagios
> nagios_group=nagios
> check_external_commands=1
> command_check_interval=-1
> command_file=/usr/local/nagios/var/rw/nagios.cmd
> comment_file=/usr/local/nagios/var/comments.dat
> downtime_file=/usr/local/nagios/var/downtime.dat
> lock_file=/usr/local/nagios/var/nagios.lock
> temp_file=/usr/local/nagios/var/nagios.tmp
> event_broker_options=-1
> log_rotation_method=d
> log_archive_path=/usr/local/nagios/var/archives
> use_syslog=0
> log_notifications=1
> log_service_retries=1
> log_host_retries=1
> log_event_handlers=1
> log_initial_states=0
> log_external_commands=1
> log_passive_checks=1
> service_inter_check_delay_method=s
> max_service_check_spread=30
> service_interleave_factor=s
> host_inter_check_delay_method=s
> max_host_check_spread=30
> max_concurrent_checks=0
> service_reaper_frequency=3
> auto_reschedule_checks=0
> auto_rescheduling_interval=30
> auto_rescheduling_window=180
> sleep_time=0.25 
> service_check_timeout=60
> host_check_timeout=30
> event_handler_timeout=30
> notification_timeout=30
> ocsp_timeout=5
> perfdata_timeout=5
> retain_state_information=1
> state_retention_file=/usr/local/nagios/var/retention.dat
> retention_update_interval=60
> use_retained_program_state=1
> use_retained_scheduling_info=0
> interval_length=60
> use_aggressive_host_checking=0
> execute_service_checks=1
> accept_passive_service_checks=1
> execute_host_checks=1
> accept_passive_host_checks=1
> enable_notifications=1
> enable_event_handlers=1
> process_performance_data=1
> service_perfdata_command=process-service-perfdata
> obsess_over_services=1
> ocsp_command=nagios-data-logger
> check_for_orphaned_services=0
> check_service_freshness=1
> service_freshness_check_interval=60
> check_host_freshness=0
> host_freshness_check_interval=60
> aggregate_status_updates=1
> status_update_interval=15
> enable_flap_detection=0
> low_service_flap_threshold=5.0
> high_service_flap_threshold=20.0
> low_host_flap_threshold=5.0
> high_host_flap_threshold=20.0
> date_format=us
> p1_file=/usr/local/nagios/bin/p1.pl
> illegal_object_name_chars=`~!$%^&*|'"<>?,()=
> illegal_macro_output_chars=`~$&|'"<>
> use_regexp_matching=0
> use_true_regexp_matching=0
> admin_email=nagios
> admin_pager=pagenagios
> daemon_dumps_core=1
> 
> Any suggestions appreciated
> 
> Thanx
> 
> Brian
> 
> -- 
> 5 GB Mailbox, 50 FreeSMS http://www.gmx.net/de/go/promail
> +++ GMX - die erste Adresse f�r Mail, Message, More +++
> 
> 
> 
> --__--__--
> 
> _______________________________________________
> Nagios-users mailing list
> Nagios-users at lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/nagios-users
> 
> 
> End of Nagios-users Digest
>


More information about the Users mailing list