Status oscillates w/check_by_ssh. Multiple daemons?

Scott Zak ZakS at so.ct.edu
Tue Dec 28 17:08:09 CET 2004


Marc,

The other main reason for running multiple daemons is to allow us to 
distribute and compartmentalize nagios administrator privileges.  We have 
four differenct executable files that are gathering stats behind a single 
apache.  The person in charge of each nagios is fully able to reconfigure, 
restart and fix (or fry) anything within his domain.  Our shop has been 
doing this since NetSaint 0.0.7 and we have been running cleanly on this 
instance for about a year now.  It's only when trying to hook in this new 
solaris 9 server that we are beginning to see problems, which leads me to 
believe that the issue is of a more specific nature and not a generalized 
failure.

Is there any specific reason to believe that our configuration is 
impossible and/or contributing to the originally described problem?  To 
you it's a non-standard configuration, but for me it's all I've got.

: - )

Scott Zak

__________________


You know that Nagios provides multi-view functionality right out of the 
box with just a single process and a single set of config files, right? If 
you're running multiple daemons just for different views then you've gone 
down the wrong path. With authentication enabled users are only able to 
see hosts and services that they are authorized contacts for.

--
Marc

________________________________________
From: nagios-users-admin at lists.sourceforge.net 
[mailto:nagios-users-admin at lists.sourceforge.net] On Behalf Of Scott Zak
Sent: Tuesday, December 28, 2004 8:22 AM
To: nagios-users at lists.sourceforge.net
Subject: RE: [Nagios-users] Re: Status oscillates w/check_by_ssh.


We actually have four separate nagios daemons running to provided 
different views to persons who monitor groups of hosts and services.  They 
have parallel configurations in some aspects, but they shouldn't overlap. 
 Thanks for the tip.  I'll check it out.

Scott Zak
__________________




I had this happening when I had multiple nagios processes running.  You 
should check that out.



Scott Yem

Research Computing Services

Agilent Laboratories

________________________________________



From:nagios-users-admin at lists.sourceforge.net 
[mailto:nagios-users-admin at lists.sourceforge.net] On Behalf Of Scott Zak
Sent: Monday, December 27, 2004 1:27 PM
To: nagios-users at lists.sourceforge.net
Subject: [Nagios-users] Re: Status oscillates w/check_by_ssh.




After allowing the service checks to run for a while, I'm finding that the 
service status is oscillating between OK and UNKNOWN.  So it's working 
sometimes, and sometimes it's not.  Yikes.

It doesn't seem likely that this has anything to do with timeouts or 
missed thresholds (I'm going to try increasing them anyway).



__________________



It's doing what it is supposed to do -- returning zero.

That is also the return when running check_by_ssh on the nagios box.

Scott


----- Forwarded by Scott Zak/IST/CSUSO on 12/27/04 01:18 PM -----



D Brian Hendrix <dhendrix2 at csc.com>

12/27/04 12:57 PM


To:        "Scott Zak" <ZakS at sysoff.ctstateu.edu>
cc:
Subject:        Re: [Nagios-users] Status Unknown w/check_by_ssh.  Command 
line OK.









Scott,

When you run the command on the Sun box, what is the error code returned?

Use the following command:
> echo $?

You should get a zero (0) if successful, or a one (1) or higher if not
successful.

(Embedded image moved to file: pic06900.gif)
Brian Hendrix
Senior System Administrator - DCI
IT/9KIDD, Baptist Hospital
2000 Church Street, Nashville, TN    37062
(615) 284-5297 work
(615) 222-1704 fax
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Whatever you do, do your work heartily,
as for the Lord rather than for men
- Colossians 3:23
~~~ ><> ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~



"Scott Zak"
<ZakS at sysoff.ctst
ateu.edu>                                                  To
Sent by:                  nagios-users at lists.sourceforge.net
nagios-users-admi                                          cc
n at lists.sourcefor
ge.net                                                Subject
[Nagios-users] Status Unknown
w/check_by_ssh.  Command line OK.
12/27/2004 11:52
AM









Hi ,

I'm setting up check_by_ssh to invoke a script via forced command on a
remote server to check a service status.  The script returns the correct
status on the server where it lives, and when I run check_by_ssh on the
command-line, all is well.  Nagios runs the remote command (and displays
the contents of stdout on the status detail page), but the status always
comes back 'Unknown'.

Command-line:
[nagios at nagios1]$ libexec/check_by_ssh -H 149.152.10.183 -l remoteuser -i
/path/to/rsa/identity_key  -C 'LDAP'
LDAP daemon is running.

When nagios runs the service check, the same 'LDAP daemon is running.'
message appears in the status information, but the status is nevertheless
marked as hard  'UNKNOWN'.

Here's checkcommands.cfg:

define command{
command_name    check-cp-ldap
command_line    $USER1$/check_by_ssh -H $HOSTADDRESS$ -l remoteuser
-i /path/to/rsa/identity_key -C 'LDAP'
}

This is the status log entry:
[1104169159]
SERVICE;soluminis1;LDAP;UNKNOWN;3/3;HARD;1104169025;1104169205;ACTIVE;
1;1;1;1104157840;0;UNKNOWN;3706;335959;419;530;1104165074;2;1;0;1;1;0;0.00;0;1;1;1;LDAP
daemon is running.


Nagios is running on RH Linux, and the target host is Solaris 9.  Nagios'
SSH is OpenSSH and the Solaris box is running Sun_SSH_1.0, protocol
versions 1.5/2.0.  The remote script worked correctly when monitoring
services on a Solaris 8 box which was running SSH Secure Shell
(non-commercial license).  Check_by_ssh is from nagios-plugins 
1.4.0alpha1,
version 1.18.

What am I not seeing?  It's probably some bonehead maneuver on my part, 
but
has anyone else run into (and found their way around) this?

ScottZak
ConnecticutStateUniversitySystem.


Confidentiality Notice: This email message, including any attachments,
contains
information that is confidential and/or legally privileged. The 
information
is intended
only for the individual(s) named above. If you are not the intended
recipient or the
person responsible for delivering the email to the intended recipient, be
advised that
you have received this email in error and that any use, dissemination,
distribution,
forwarding, printing, or copying of this email is strictly prohibited. If
you have received
this email in error, please purge it immediately and notify the sender.




-------------------------------------------------------
SF email is sponsored by - The IT Product Guide
Read honest & candid reviews on hundreds of IT Products from real users.
Discover which products truly live up to the hype. Start reading now.
http://productguide.itmanagersjournal.com/
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when 
reporting any issue.
::: Messages without supporting info will risk being sent to /dev/null
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://www.monitoring-lists.org/archive/users/attachments/20041228/f68521b9/attachment.html>


More information about the Users mailing list