Buggy Nagios 2.x performance data?

Hugo van der Kooij hvdkooij at vanderkooij.org
Fri Apr 7 19:08:41 CEST 2006


Hi,

I have build a new system which prime task will be nagios. Nagios seems to
be running just fine and I was adding NagiosGrapher v1.5 to it to handle
the performance data. But it seems I still have issues with performance
data missing.

I started noticing after I added extra servers instead of graphing just
about anything I could check with nagios on my local server.

So I did check with tcpdump to see if the checks are done every 3 minutes
for a particular host. tcpdumps shows me every 3 minutes the right traffic
for the checks (PING + DNS):

08:05:20.144479 IP arwen.waakhond.net.32994 > ns2.mydyndns.org.domain:  20487+ A? www.vanderkooij.org. (37)
08:05:20.263711 IP ns2.mydyndns.org.domain > arwen.waakhond.net.32994:  20487*- 2/5/4 CNAME[|domain]
08:05:45.154599 IP arwen.waakhond.net > ns2.mydyndns.org: icmp 64: echo request seq 0
08:05:45.274403 IP ns2.mydyndns.org > arwen.waakhond.net: icmp 64: echo reply seq 0
08:05:46.155946 IP arwen.waakhond.net > ns2.mydyndns.org: icmp 64: echo request seq 1
08:05:46.271167 IP ns2.mydyndns.org > arwen.waakhond.net: icmp 64: echo reply seq 1
08:05:47.156796 IP arwen.waakhond.net > ns2.mydyndns.org: icmp 64: echo request seq 2
08:05:47.273669 IP ns2.mydyndns.org > arwen.waakhond.net: icmp 64: echo reply seq 2
08:05:48.158648 IP arwen.waakhond.net > ns2.mydyndns.org: icmp 64: echo request seq 3
08:05:48.278399 IP ns2.mydyndns.org > arwen.waakhond.net: icmp 64: echo reply seq 3
08:05:49.159490 IP arwen.waakhond.net > ns2.mydyndns.org: icmp 64: echo request seq 4
08:05:49.274105 IP ns2.mydyndns.org > arwen.waakhond.net: icmp 64: echo reply seq 4

The ngraph.log file show intermittant behavior consistent with the data in
the grahps.

This behaviour is restricted to the servers I have added after the server
itself. All the checks on the local machine work and are logged
consistently in the grahps.

If I babysit the machine I can see the servers/servicices get updated
regular like clockwork in the nagios web pages. There is in fact new
performance data every 3 minutes on the pages.

At this point I think we can rule out the plugins as Nagios gets the
proper data but somehow they do not seem to make it all the way.

I do appreciate some help in narrowing this one down and, if nescessary,
providing the coders with the right data so it can be solved.


Just for good measures here is a lot of details:

# uname -a
Linux arwen.waakhond.net 2.6.9-34.EL.XOS.1 #1 Sat Mar 11 06:39:32 CET 2006 i686 athlon i386 GNU/Linux

# rpm -qa|grep nagios
nagios-plugins-1.4.2-1.2.el4.rf
nagios-2.1-1.el4.rf

NagiosGrapher-1.5-1


define host {
        name                            host-template
        register                        0
        check_command                   check-host-alive
        max_check_attempts              3
        active_checks_enabled           1
#       passive_checks_enabled          1
        check_period                    24x7
        retain_status_information       1
        retain_nonstatus_information    1
        notification_interval           60
        notification_period             24x7
        notification_options            d,u,r,f
        notifications_enabled           1
}

define host {
        use                             host-template
        host_name                       arwen
        alias                           arwen.waakhond.net
        address                         192.0.2.101
        hostgroups                      waakhond-net
        contact_groups                  waakhond-net
}

define service {
        name                            service-template
        register                        0
        max_check_attempts              3
        normal_check_interval           1
        retry_check_interval            1
        active_checks_enabled           1
        passive_checks_enabled          1
        check_period                    24x7
        retain_status_information       1
        retain_nonstatus_information    1
        notification_interval           60
        notification_period             24x7
        notification_options            w,u,c,r,f
        notifications_enabled           1
}

define service {
        use                             service-template
        host_name                       arwen
        service_description             PING
        check_command                   check_ping!10,20%!20,40%
        contact_groups                  waakhond-net
}

# 'check_ping' command definition
define command{
        command_name    check_ping
        command_line    $USER1$/check_ping -H $HOSTADDRESS$ -w $ARG1$ -c
$ARG2$ -p 5
        }

define host {
        name                            mydyndns-template
        register                        0
        check_command                   check-host-alive
        max_check_attempts              1
        active_checks_enabled           1
#       passive_checks_enabled          1
        check_period                    24x7
        retain_status_information       1
        retain_nonstatus_information    1
        notification_interval           60
        notification_period             24x7
        notification_options            d,u,r,f
        notifications_enabled           1
}

define host{
        use                             mydyndns-template
        host_name                       ns2.mydyndns.org
        alias                           DNS server 2
        address                         204.13.249.82
        hostgroups                      mydyndns-org
        contact_groups                  mydyndns-org
        }

define service{
        use                             mydyndns-template
        host_name                       ns2.mydyndns.org
        service_description             DNS
        check_command                   check_dig!1!5!www.vanderkooij.org
        contact_groups                  mydyndns-org
        }

define service{
        use                             mydyndns-template
        host_name                       ns2.mydyndns.org
        service_description             PING
        check_command                   check_ping!200,20%!500,60%
        contact_groups                  mydyndns-org
}

define command{
        command_name                    check_dig
        command_line                    $USER1$/check_dig -H $HOSTADDRESS$
-w $ARG1$ -c $ARG2$ -l $ARG3$
        }


# ---
# Nagios Grapher Main Configuration
# ---

define config {
        pipe                    /var/spool/nagios/ngraph.pipe
        buffer                  1024
        interface               pipe
        step                    300
        heartbeat               600
        rrdpath                 /var/spool/nagios/rrd/
        tmppath                 /tmp/nagiosgrapher/
        serviceext_type         MULTIPLE
        serviceext_path         /etc/nagios/service-extinfo
        url                     /nagios/cgi-bin/graphs.cgi
        nagios_config           /etc/nagios/nagios.cfg
        cgi_config              /etc/nagios/cgi.cfg
        icon_image_tag          dot.gif' border="0"></a><A TARGET="_blank"
HREF="graphs.cgi?###URL###"><img src='###IMAGESRC###'
        icon_image_src                  /nagios/images/logos/graph.png
        log_file                /var/log/nagios/ngraph.log
        log_level               63
	}

define ngraph{
        service_name            PING
        graph_log_regex         loss = ([0-9]+)
        graph_value             Loss
        graph_units             %
        graph_legend            Packet Lost
        rrd_plottype            LINE2
        rrd_color               ff0000
        page                    Packet Lost
        }

define ngraph{
        service_name            PING
        graph_log_regex         rta = ([0-9]+\.+[0-9]*) ms
        graph_value             RTA
        graph_units             ms
        graph_legend            Time to answer
        rrd_plottype            AREA
        rrd_color               00A348
        page                    RTA
        }



-- 
	I hate duplicates. Just reply to the relevant mailinglist.
	hvdkooij at vanderkooij.org		http://hvdkooij.xs4all.nl/
		Don't meddle in the affairs of magicians,
		for they are subtle and quick to anger.


-------------------------------------------------------
This SF.Net email is sponsored by xPML, a groundbreaking scripting language
that extends applications into web and mobile media. Attend the live webcast
and join the prime developer group breaking into this new coding territory!
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=110944&bid=241720&dat=121642
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null





More information about the Users mailing list