<!DOCTYPE html PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN">
<html>
<head>
  <meta content="text/html;charset=ISO-8859-1" http-equiv="Content-Type">
</head>
<body bgcolor="#ffffff" text="#000000">
Hi Brian<br>
<br>
Whenever I'd changed the configuration I always used "killall -HUP
nagios."<br>
<br>
The machine does use the rc init scripts on bootup and shutdown. 
Several times I've kill'd the nagios process so absolutely nothing of
nagios was running, removed all the state information from nagios/var,
then used the rc init script to fire it back up again.<br>
<br>
I understand Nagios spawns more processes, I just thought it strange it
would spawn another daemon during a check?<br>
<br>
Anyway, last night we had some awful problems on that machine (files
not being found when they were actually there - probably explaining the
"127 - plugin not found" message.)<br>
<br>
I rebooted the machine last night and kicked Nagios off at shortly
before midnight.  Out of my 17 services and 5 hosts, there are only two
services that are playing up now - the rest have been in the same
(correct) state for 15 and half hours now.<br>
<br>
The 2 that are playing up still is an nrpe check which executes a
check_disk and check_swap command on another machine.  The status logs
of the 2 services since midnight are shown below.<br>
<br>
I have no idea where "tcp" comes in, as it's not present in any of the
commands ("/usr/local/nagios/libexec/check_nrpe -H $ARG1$ -p
5666 -t 60 -c $ARG2$") - $ARG1$ is $HOSTADDRESS$ and $ARG2$ is
"check_endeavour" and "check_swap" for the 2 checks respectively.  The
NRPE on the remote machine for these two commands reads:<br>
<br>
command[check_endeavour]=/usr/local/nagios/libexec/check_disk -p
/endeavour -w 10% -c 5%<br>
command[check_swap]=/usr/local/nagios/libexec/check_swap -w 10% -c 5%<br>
<br>
Is it possible this machine is having difficulty reading the
/etc/services file, hence not being able to map a protocol number?<br>
<br>
### START STATUS LOG 2 ###<br>
<br>
<table class="logEntries" border="1" cellpadding="3" cellspacing="0">
  <tbody>
    <tr class="logEntriesOdd">
      <td class="logEntriesOdd">28-08-2006 00:00:00</td>
      <td class="logEntriesOdd">28-08-2006 05:50:49</td>
      <td class="logEntriesOdd">0d 5h 50m 49s</td>
      <td class="logEntriesOK">SERVICE OK (HARD)</td>
      <td class="logEntriesOdd">DISK OK - free space: /endeavour 135492
MB (94% inode=100%):</td>
    </tr>
    <tr class="logEntriesEven">
      <td class="logEntriesEven">28-08-2006 05:50:49</td>
      <td class="logEntriesEven">28-08-2006 05:56:32</td>
      <td class="logEntriesEven">0d 0h 5m 43s</td>
      <td class="logEntriesUNKNOWN">SERVICE UNKNOWN (HARD)</td>
      <td class="logEntriesEven">Cannot map "tcp" to protocol number</td>
    </tr>
    <tr class="logEntriesOdd">
      <td class="logEntriesOdd">28-08-2006 05:56:32</td>
      <td class="logEntriesOdd">28-08-2006 14:32:36</td>
      <td class="logEntriesOdd">0d 8h 36m 4s</td>
      <td class="logEntriesOK">SERVICE OK (HARD)</td>
      <td class="logEntriesOdd">DISK OK - free space: /endeavour 135492
MB (94% inode=100%):</td>
    </tr>
    <tr class="logEntriesEven">
      <td class="logEntriesEven">28-08-2006 14:32:36</td>
      <td class="logEntriesEven">28-08-2006 14:38:33</td>
      <td class="logEntriesEven">0d 0h 5m 57s</td>
      <td class="logEntriesUNKNOWN">SERVICE UNKNOWN (HARD)</td>
      <td class="logEntriesEven">Cannot map "tcp" to protocol number</td>
    </tr>
    <tr class="logEntriesOdd">
      <td class="logEntriesOdd">28-08-2006 14:38:33</td>
      <td class="logEntriesOdd">28-08-2006 14:55:34</td>
      <td class="logEntriesOdd">0d 0h 17m 1s+</td>
      <td class="logEntriesOK">SERVICE OK (HARD)</td>
      <td class="logEntriesOdd">DISK OK - free space: /endeavour 135482
MB (94% inode=100%):</td>
    </tr>
  </tbody>
</table>
<br>
### END STATUS LOG 2 ###<br>
<br>
### START STATUS LOG 2 ###<br>
<br>
<table class="logEntries" border="1" cellpadding="3" cellspacing="0">
  <tbody>
    <tr class="logEntriesOdd">
      <td class="logEntriesOdd">28-08-2006 00:00:00</td>
      <td class="logEntriesOdd">28-08-2006 05:50:40</td>
      <td class="logEntriesOdd">0d 5h 50m 40s</td>
      <td class="logEntriesOK">SERVICE OK (HARD)</td>
      <td class="logEntriesOdd">SWAP OK - 100% free (7828 MB out of
7828 MB)</td>
    </tr>
    <tr class="logEntriesEven">
      <td class="logEntriesEven">28-08-2006 05:50:40</td>
      <td class="logEntriesEven">28-08-2006 05:56:32</td>
      <td class="logEntriesEven">0d 0h 5m 52s</td>
      <td class="logEntriesUNKNOWN">SERVICE UNKNOWN (HARD)</td>
      <td class="logEntriesEven">Cannot map "tcp" to protocol number</td>
    </tr>
    <tr class="logEntriesOdd">
      <td class="logEntriesOdd">28-08-2006 05:56:32</td>
      <td class="logEntriesOdd">28-08-2006 15:00:36</td>
      <td class="logEntriesOdd">0d 9h 4m 4s+</td>
      <td class="logEntriesOK">SERVICE OK (HARD)</td>
      <td class="logEntriesOdd">SWAP OK - 100% free (7828 MB out of
7828 MB)</td>
    </tr>
  </tbody>
</table>
<br>
### END STATUS LOG 2 ###<br>
<br>
<br>
Thanks<br>
<br>
Andy.<br>
</body>
</html>