nagios 2.9 (4/10/07) check frequency increases dynamically

Bruce Friedman bruce.friedman at infodyne.com
Wed Aug 29 22:46:57 CEST 2007


Hello,
 
Has anyone run across the check frequency increasing dynamically when
the number of passive check failures reaches a high percentage?

I have only two active checks running for a specific remote server - a
tcp port check for port 22 being open, and a custom check which uses
snmpgetbulkget to retrieve a number of items from a non-standard snmp
server.

The custom plugin takes the results of the snmpbulkget and stuffs them,
one at a time into passive checks for the given server using the nagios
external command file.  Each of the passive checks is setup to not
enable active checks, and as a check_command, it uses a simple script
which fails always (it is never to be executed).

There are approximately 100 passive checks on this server with the two
active checks.  

I have noticed that with nagios 2.9, that when a large percentage of the
passive checks fail, that Nagios is increasing the frequency of active
checks for that server (only - not all servers) by as much as a factor
of 3.5.

I have not seen documentation describing this activity, and I'm
wondering if it can be configured to not do this.

The frequency of the active checks is once a minute - here is the
service config for the two active checks:

define service{
        name                            generic-service-normal
        active_checks_enabled           1
        passive_checks_enabled          1
        parallelize_check               1
        obsess_over_service             1
        check_freshness                 0
        notifications_enabled           1
        event_handler_enabled           1
        flap_detection_enabled          1
        process_perf_data               1
        retain_status_information       1
        retain_nonstatus_information    1
        is_volatile                     0
        check_period                    24x7
        max_check_attempts              3
        normal_check_interval           1
        retry_check_interval            1
        notification_interval           120
        notification_period             24x7
        notification_options            w,u,c,r
        contact_groups                  admins
        register                        0
        }

Note that the active checks are not failing during this increased check
frequency - only the passive checks are failing, but that appears to be
increasing the rate of the active checks.  I'm stumped!  
 
--
regards,           

Bruce L. Friedman
Network & System Administration
InfoDyne Corporation
1460 Renaissance Drive
Park Ridge, IL 60068


-------------------------------------------------------------------------
This SF.net email is sponsored by: Splunk Inc.
Still grepping through log files to find problems?  Stop.
Now Search log events and configuration files using AJAX and a browser.
Download your FREE copy of Splunk now >>  http://get.splunk.com/
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null





More information about the Users mailing list