problems with distributed monitoring

Dmitriy Kirhlarov dkirhlarov at oilspace.com
Thu Feb 3 17:00:00 CET 2005


Hi!

We are trying to implement distributed monitoring scheme with snmp and passive checks, in general.

For transfering passive checks results from a client to "monitoring" nagios we use nsca without crypting (BTW, nsca-2.4 can't be compiled without libmcrypt. We had be write a patch)
Data are transfered every min.

"Monitoring" nagios has oscp_command set to script, which puts data to "to_send.dump" file. Every 30 sec other script transfers accumulated data from the "to_send.dump" file to "central" nagios via nsca over ssh-forwarded port.

All work fine on the "monitoring" nagios. All services "OK".
But, on "central" nagios most part of services are in flapping and/or unknown status.
As I can see, those services was configured as active checks on "monitoring" nagios.

My templates:
on "monitoring" nagios:
-----
define service{
        name                            generic-service
        register                        0
        is_volatile                     0
        max_check_attempts              1
        normal_check_interval           1
        retry_check_interval            1
        check_period                    24x7
        parallelize_check               1
        obsess_over_service             1
        check_freshness                 1
        event_handler_enabled           1
        flap_detection_enabled          1
        process_perf_data               1
        retain_status_information       1
        retain_nonstatus_information    1
        notification_interval           120
        notification_period             24x7
        notification_options            w,u,c,r
        notifications_enabled           0
        contact_groups                  admins
        }
define service{
        use                             generic-service
        register                        0
        name                            minfra01-active
        freshness_threshold             180
        passive_checks_enabled          1
        active_checks_enabled           1
        }

define service{
        use                             generic-service
        register                        0
        name                            minfra01-passive
        freshness_threshold             180
        passive_checks_enabled          1
        active_checks_enabled           0
        }

-----

On "central" nagios:
----
define service{
        name                            generic-service
        register                        0
        is_volatile                     0
        max_check_attempts              3
        normal_check_interval           4
        retry_check_interval            2
        check_period                    24x7
        parallelize_check               1
        obsess_over_service             1
        check_freshness                 1
        event_handler_enabled           1
        flap_detection_enabled          1
        process_perf_data               1
        retain_status_information       1
        retain_nonstatus_information    1
        notification_interval           120
        notification_period             24x7
        notification_options            w,u,c,r
        notifications_enabled           0
        contact_groups                  admins
        }
define service{
        use                             generic-service
        register                        0
        name                            minfra01-active
        passive_checks_enabled          1
        active_checks_enabled           0
        }

define service{
        use                             generic-service
        register                        0
        name                            minfra01-passive
        freshness_threshold             600
        passive_checks_enabled          1
        active_checks_enabled           0
        }
----

WBR
-- 
Dmitriy Kirhlarov
OILspace, 4, B. Spasskaya str. room 01, 107978 Moscow, Russia
P:+7 095 787 4700 F:+7 095 787 4701 E:DmitriyKirhlarov at oilspace.com
OILspace - The resource enriched - www.oilspace.com


-------------------------------------------------------
This SF.Net email is sponsored by: IntelliVIEW -- Interactive Reporting
Tool for open source databases. Create drag-&-drop reports. Save time
by over 75%! Publish reports on the web. Export to DOC, XLS, RTF, etc.
Download a FREE copy at http://www.intelliview.com/go/osdn_nl
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null





More information about the Users mailing list