SNMP checks randomly come back as unknown

Eirik Robertstad erobertstad at perimeterusa.com
Thu Sep 14 20:04:43 CEST 2006


I use the SNMP checks from http://www.manubulon.com/nagios/ and I've
seen some strange things happen.  I'm not sure if this has to do with
those checks directly or with Nagios.

Checks will just 'stop' for a while. Most of the time it's the Disk
check, but not always.  Using Nagiosgraph though the disk checks always
are missing data here and there. For the most part it seams to be
limited to Windows machines, but I've seen it happen with Linux as well.

I'll get back an error of: ERROR: Description/Type table : No response
from remote host 'xxx.xxx.xxx.xxx'.

On this example host, there are 5 other SNMP checks, out of those the
Disk check and the Virtual memory check are in Unknown.  I've been
waiting and waiting, and it's been 16 days now for the Virtual memory
check and 3 days now for the Drive C check. Nagios has been restarted
many of times and this check has never come out of an 'unknown' status.

If I force a check of this service, it will come back fine however and
continue to work for a while till it breaks again.

It seams to be random on what hosts and checks Nagios decides it no
longer can connect to, but it's always the same, it stays in this state
till you force a recheck. I've watched Nagios to be sure it's actually
running the checks, and I can see in the logs it getting executed.

With a tcpdump I can even see it sending data to the host and getting
the response back.  Yet, I can execute it by hand right before and right
after, with perfect results, and again once I force it in Nagios, it
will of course go OK and stay that way till it blows up again..... I'm
out of ideas.

Is there a difference in the way Nagios executes a normally scheduled
check and a forced re-check?

Thanks in advance.



--
The sender of this email subscribes to Perimeter Internetworking's email
anti-virus service. This email has been scanned for malicious code and is
believed to be virus free. For more information on email security please visit:
http://www.perimeterusa.com/malicious_code_defense_content.html

This communication is confidential, intended only for the named recipient(s)
above and may contain trade secrets or other information that is exempt from
disclosure under applicable law. Any use, dissemination, distribution or
copying of this communication by anyone other than the named recipient(s) is
strictly prohibited. If you have received this communication in error, please
delete the email and immediately notify our Command Center at 203-541-3444.

-------------------------------------------------------------------------
Using Tomcat but need to do more? Need to support web services, security?
Get stuff done quickly with pre-integrated technology to make your job easier
Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=120709&bid=263057&dat=121642
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null





More information about the Users mailing list