Continued downtime

Carl Schelin cschelin at hq.nasa.gov
Mon Mar 1 19:59:12 CET 2004


Greetings,

   Due to a power outage over the weekend, I had the admins schedule
downtime for about 50 systems that'd be affected.

   They missed three systems and they started paging out when they went
down. The admin on site configured them for downtime:15 hours.

   One of the systems never came back from downtime.

   I can see it being activated in the logs but there's no corresponding
activation the following day.

   At the time it should have come off of downtime, there were lots of
pages for an application on a different server that didn't get
restarted. It paged out all night.

   Today, while checking everything, I noticed several processes in
memory however none were associated with the system.

   I stopped nagios, waited for the processes to die down and restarted.
Once I did that, the system "came back up".

   With that, I have two questions here:

   1. Is there a way to force a system to come off of downtime?
   2. Is there something I could have done differently to ensure the
host becomes available after downtime?

   Nagios 1.2
   Nagios-Plugins 1.3.1
   Solaris 7

   I did search the FAQ, Google Groups/Web and the mail list.

   Thanks,

Carl



-------------------------------------------------------
SF.Net is sponsored by: Speed Start Your Linux Apps Now.
Build and deploy apps & Web services for Linux with
a free DVD software kit from IBM. Click Now!
http://ads.osdn.com/?ad_id=1356&alloc_id=3438&op=click
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null





More information about the Users mailing list