Losing Servers/Services

Carl Schelin cschelin at hq.nasa.gov
Mon Jul 21 21:28:11 CEST 2003


Server: Solaris 7, Apache 1.3.20, Ben-SSL/1.44, Nagios 1.1, Plug-ins
1.3.1

Workstation1: Solaris 7, Netscape 4.77
Workstation2: Mandrake 9.1, Mozilla 1.3.1
Workstation3: Windows 200, IE 6.0.2800.1106

Problem: The number of hosts and services change with Service Detail and
Host Detail refreshes. Hosts/Services I've removed are still ghosting
around.

First I added the print queues (etc/printers.cfg) after whipping up a
smbclient shell script. The number of services in Service Detail and
Host Detail changed from 609 to 139 and it appeard that the printers
weren't being monitored any more.

After a bunch of warnings were e-mailed to me so I set
active_checks_enabled=0.

Second I had 13 or so hosts that were not production systems so didn't
need to be monitored. I created a new template; generic-service-down and
set active_checks_enabled=0. I changed the use line in each of the down
servers.

Third, when I got in this morning there were some 5,000 e-mails waiting
for me. About 4,500 of them were status checks from the printers and the
disabled servers. I removed printers.cfg from nagios.cfg.

Fourth I added a new server this morning and just had a PING service. I
also removed the 13 disabled hosts from hostgroups, hosts and services.
I commented out all of the printers but one and added printers.cfg back
in to nagios.cfg.

When I bring up the 3d status map, the new server is grayed out and says
no information is available. Clicking on the grayed out server will
refresh the information on 3d status map.

The status.sav file had the new server in it but not in status.log.
After a few minutes, it rolled out of the status.sav file.

If I refresh the Service Detail or Host Detail page, the new server will
show up but after the auto-refresh, the number of hosts/services dropps
from 115/119 to 114/117. In a few minutes it drops to 112/115.

I've flushed the disk/memory cache from my browsers and just like the
refresh, the first page is ok but following refreshes will drop the
number of servers/services down.

I'm getting down in the dirt on this and figure this is about the time
to post a query.

Thanks in advance.

Carl



-------------------------------------------------------
This SF.net email is sponsored by: VM Ware
With VMware you can run multiple operating systems on a single machine.
WITHOUT REBOOTING! Mix Linux / Windows / Novell virtual machines at the
same time. Free trial click here: http://www.vmware.com/wl/offer/345/0
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null





More information about the Users mailing list