Distributed nagios problem - service definition not found!

TIM MOORE MOORET10 at odjfs.state.oh.us
Wed Oct 6 21:06:47 CEST 2004


I just recently setup distributed nagios.  I followed the directions very closely.  I first had a problem running the nsca daemon through xinetd.  It just wouldn't listen for incoming on 5667.  I added the line to /etc/services also.  Here is my config:
 
service nsca
{
        flags           = REUSE
        socket_type     = stream
        wait            = no
        user            = nagios
        group           = nagios
        server          = /usr/local/nagios/bin/nsca
        server_args     = -c /usr/local/nagios/etc/nsca.cfg
        log_on_failure  += USERID
        disable         = no
        only_from       = 10.12.225.50
}

If I run it from command line in daemon mode it works fine.  
 
My main problem, is that when passive checks are sent to the central server I keep getting this error:
 
Oct  6 15:02:28 noc-mon nsca[31620]: Connection from 10.12.225.50 port 38784
Oct  6 15:02:28 noc-mon nsca[31620]: Host address checks out ok
Oct  6 15:02:28 noc-mon nsca[31620]: Handling the connection...
Oct  6 15:02:29 noc-mon nsca[31620]: SERVICE CHECK -> Host Name: 'localhost', Service Description: 'cpu', Return Code: '0', Output: 'OK - load average: 0.00, 0.00, 0.00'
Oct  6 15:02:29 noc-mon nsca[31620]: End of connection...
Oct  6 15:02:30 noc-mon nagios: EXTERNAL COMMAND: PROCESS_SERVICE_CHECK_RESULT;localhost;cpu;0;OK - load average: 0.00, 0.00, 0.00
Oct  6 15:02:39 noc-mon nsca[31817]: Connection from 10.12.225.50 port 39040
Oct  6 15:02:39 noc-mon nsca[31817]: Host address checks out ok
Oct  6 15:02:39 noc-mon nsca[31817]: Handling the connection...
Oct  6 15:02:40 noc-mon nsca[31817]: SERVICE CHECK -> Host Name: 'acdmz-inside-sw2', Service Description: 'Check Host Alive', Return Code: '0', Output: 'PING OK - Packet loss = 0%, RTA = 0.83 ms'
Oct  6 15:02:40 noc-mon nsca[31817]: End of connection...
Oct  6 15:02:40 noc-mon nagios: EXTERNAL COMMAND: PROCESS_SERVICE_CHECK_RESULT;acdmz-inside-sw2;Check Host Alive;0;PING OK - Packet loss = 0%, RTA = 0.83 ms
Oct  6 15:02:44 noc-mon nagios: Warning:  Message queue contained results for service 'Check Host Alive' on host 'acdmz-inside-sw2'.  The service could not be found!

The localhost check acts like it works, but the simple check-host-alive service definition is not.  I know that that service definition is on both servers.  They are both running v1.2.  Also, should I see something on my central server's web gui showing these hosts down?  My host count has not been affected at all by the hosts added to the distributed server.  Am I missing something?  Is there something wrong with the default check-host-alive service check?
 
Thanks for any help,
  
 
--------------------------------------
Tim Moore
DNS/Linux/Cisco Admin
ODJFS

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://www.monitoring-lists.org/archive/users/attachments/20041006/7fca3153/attachment.html>


More information about the Users mailing list