2.0rc2 avail.cgi - possible bug in reporting of HOST DOWNTIME START/END EVENTS.

Stanley.Hopcroft at Dest.gov.au Stanley.Hopcroft at Dest.gov.au
Fri Feb 3 01:46:38 CET 2006


Dear Folks,

I am writing to report a possible anomaly/bug in avail.cgi for Nag 2.0
rc2

(RPM based on Dag Wieers for RHEL3).

The problem is that when a host has exited a period of scheduled
downtime, the 'Host log
entries' shown by avail.cgi look like

Event Start Time     Event End Time    Event Duration   Event/State Type
Event/State Information
01-02-2006 00:00:00 01-02-2006 14:27:54 0d 14h 27m 54s  HOST UP (HARD)
PING OK - Packet loss = 0%, RTA = 0.82 ms
02-02-2006 20:59:44 02-02-2006 20:59:44 0d 0h 0m 0s     HOST DOWN (HARD)
CRITICAL - Plugin timed out after 10 seconds
02-02-2006 20:59:44 02-02-2006 21:06:53 0d 0h 7m 9s     HOST DOWNTIME
START Start of scheduled downtime
02-02-2006 21:06:53 02-02-2006 22:59:44 0d 1h 52m 51s   HOST UP (HARD)
PING OK - Packet loss = 0%, RTA = 0.71 ms
02-02-2006 22:59:44 03-02-2006 11:35:39 0d 12h 35m 55s+ HOST DOWNTIME
END   End of scheduled downtime

and then the next time the Report is run the last line shows again how
long it was since the host
exited downtime (ie now minus the downtime end).

eg

Event Start Time     Event End Time    Event Duration  Event/State Type
Event/State Information
01-02-2006 00:00:00 01-02-2006 14:27:54 0d 14h 27m 54s HOST UP (HARD)
PING OK - Packet loss = 0%, RTA = 0.82 ms
02-02-2006 20:59:44 02-02-2006 20:59:44 0d 0h 0m 0s    HOST DOWN (HARD)
CRITICAL - Plugin timed out after 10 seconds
02-02-2006 20:59:44 02-02-2006 21:06:53 0d 0h 7m 9s    HOST DOWNTIME
START Start of scheduled downtime
02-02-2006 21:06:53 02-02-2006 22:59:44 0d 1h 52m 51s  HOST UP (HARD)
PING OK - Packet loss = 0%, RTA = 0.71 ms
02-02-2006 22:59:44 03-02-2006 11:41:51 0d 12h 42m 7s+ HOST DOWNTIME END
End of scheduled downtime

This looks a little peculiar to me. It's not a bug but unfortunately
violates the principle of least
surprise (don't know what I was expecting but ..) and for those of us
who mine the host log entries
it means some code modification.

The behaviour of the CGI seems Ok - the event duration is simply the
time to the last event - and seems reasonable.

Thanks for your time.

Yours sincerely.




-------------------------------------------------------
This SF.net email is sponsored by: Splunk Inc. Do you grep through log files
for problems?  Stop!  Download the new AJAX search engine that makes
searching your log files as easy as surfing the  web.  DOWNLOAD SPLUNK!
http://sel.as-us.falkag.net/sel?cmd=lnk&kid3432&bid#0486&dat1642
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null





More information about the Users mailing list