Strange state "UNKNOWN"

Carroll, Jim P [Contractor] jcarro10 at sprintspectrum.com
Mon Nov 11 18:41:29 CET 2002


I urge you to reconsider using NRPE, for 2 reasons:

1) Anything involving SSH will not scale well.  As the number of checks
grows, the ssh-based checks will start to impact your Nagios server.

2) When you say 40 hosts, are each and every one of them running a different
(version) of operating system?  If not, you should be able to compile once,
then install a tarball on each host.  As for the config file, I suggest you
edit a master nrpe.cfg file on one host, and as your requirements on the
clients change, you edit the master and push it out to all hosts.  This is
my approach, and it's working great.

jc

> -----Original Message-----
> From: Schindewolf, Stefan, Infraserv-Hoechst/DE
> [mailto:Stefan.Schindewolf at Infraserv.com]
> Sent: Monday, November 11, 2002 10:21 AM
> To: 'nagios-users at lists.sourceforge.net'
> Subject: [Nagios-users] Strange state "UNKNOWN"
> 
> 
> Hello Nagios Users.
> 
> I am experiencing a strange problem with the "check_by_ssh" 
> plugin. Nearly
> 3/4 of my services are check by this plugin.
> 
> However, some services go into "unknown" state and flap around between
> "unknown" and "ok".
> The message output of every "unknown" service indicate an "ok" state.
> When I hit "Reschedule" and "Force Check", the services 
> allways go into
> "OK", but later return into "unknown".
> I can not recognize a regularity, the problem seems to occur 
> accidently.
> 
> I supposed a "return code" problem. So I tried to run the 
> check in the shell
> as user "nagios". Everything is ok, message is displayed 
> correctly, even the
> return code ("echo $?" gives the return code) indicates "OK" 
> ("0" in the
> Shell).
> 
> Unfortunately Nagios does not seem to catch it. I wrote a 
> perl script, which
> executes the checks for nagios, so that nagios virtually runs a local
> command. This script executes "check_by_ssh" plus a check 
> command. Same
> Result. The return code is 0. The message output is like "OK! ..."
> Nagios says "unknown".
> Using the "real" ssh command for check execution does not 
> change anything.
> I tried to use nrpe, but to compile and configure it on up to 
> 40 hosts seems
> very time consuming.
> 
> At the moment I have no further idea, what to do and would like you to
> provide some advice.
> 
> Thanks and best regards.
> 
> 
> Stefan Schindewolf
> 
> Infraserv GmbH & Co Höchst KG
> Service Center Informationstechnologie
> D710, D-65926 Frankfurt
> Telefon: (069)305 - 83010
> Fax: (069)305 - 23549
> Mail: stefan.schindewolf at infraserv.com
> 
> 
> -------------------------------------------------------
> This sf.net email is sponsored by:ThinkGeek
> Welcome to geek heaven.
> http://thinkgeek.com/sf
> _______________________________________________
> Nagios-users mailing list
> Nagios-users at lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/nagios-users
> 


-------------------------------------------------------
This sf.net email is sponsored by:ThinkGeek
Welcome to geek heaven.
http://thinkgeek.com/sf




More information about the Users mailing list