volatile state stalking - snmp traps service notifications

Noah Leaman noah at mac.com
Thu Mar 4 06:21:36 CET 2004


I might be missing something, or going about this all the wrong way but 
here I go...

I am trying to set things up so that for each router/switch host I have 
defined (hundreds) I only need to define one Link Up/Down Trap service 
for each one. I am using SEC to correlate the traps as they are 
received so that only the important traps are ever sent to nagios (via 
external commands). So the tricky thing here as I see it is that the 
"link up events" and the "link down events" for all the various 
interfaces on any given router could occur at any time and in any 
order. This is a problem for notifications... I cannot get anymore UP 
notifications after there has been a "recovery" already. For example:

[1078358543] PROCESS_SERVICE_CHECK_RESULT;router-a;Up/Down 
Trap;2;Interface Serial5/2 is down, Keepalive failed
[1078358626] PROCESS_SERVICE_CHECK_RESULT;router-a;Up/Down 
Trap;2;Interface Serial8/3 is down, Lost Signal
[1078358742] PROCESS_SERVICE_CHECK_RESULT;router-a;Up/Down 
Trap;0;Interface Serial8/3 is up
[1078358846] PROCESS_SERVICE_CHECK_RESULT;router-a;Up/Down 
Trap;0;Interface Serial5/2 is up

I have enabled is_volatile and set stalking_options to be o,w,c,u and 
so I get the first 3 notifications, but not the 4th (I assume because 
nagios thinks the service has recovered). Here is the service 
definition:

define service {
  service_description			Up/Down Trap
  host_name				router-a
  check_command			check_dummy
  max_check_attempts		1
  active_checks_enabled		0
  passive_checks_enabled	1
  is_volatile				1
  stalking_options			o,w,c,u
  contact_groups			admins
  parallelize_check			1
  obsess_over_service		1
  check_freshness			0
  notifications_enabled		1
  event_handler_enabled		1
  flap_detection_enabled		1
  process_perf_data			1
  retain_status_information	1
  retain_nonstatus_information	1
  check_period				24x7
  normal_check_interval		5
  retry_check_interval                  1
  notification_interval			0
  notification_period			24x7
  notification_options			w,u,c,r
}

How do you all address the issue of trap monitoring when you want 
notifications for them?

-- 
Noah



-------------------------------------------------------
This SF.Net email is sponsored by: IBM Linux Tutorials
Free Linux tutorial presented by Daniel Robbins, President and CEO of
GenToo technologies. Learn everything from fundamentals to system
administration.http://ads.osdn.com/?ad_id=1470&alloc_id=3638&op=click
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null





More information about the Users mailing list