Nagios just stopped running

Rimbert Rivera rrivera at comtex.com
Wed Jan 5 22:29:49 CET 2005


It looks like we are using performance data, but I was able to rename
the current one and nagios created a new one and continues working
without having to restart nagios.  I did the same with
host-perfdata.out.  We'll add this to our periodic maintenance
procedure.



- Rim

Rimbert Rivera
Manager, Information Technology
COMTEX News Network
rrivera at comtex.com
(703) 820-2000
Discover more about COMTEX at: http://www.comtex.com/
This e-mail is intended solely for the person or entity to which it is
addressed and may contain confidential and/or privileged information.
Any review, dissemination, copying, printing or other use of this e-mail
by persons or entities other than the addressee is prohibited. If you
have received this e-mail in error, please contact the sender
immediately and delete the material from any computer.

-----Original Message-----
From: Stephan Janosch [mailto:stephan.janosch at interface-business.de] 
Sent: Wednesday, January 05, 2005 3:45 AM
To: Rimbert Rivera
Cc: nagios-users at lists.sourceforge.net
Subject: Re: [Nagios-users] Nagios just stopped running

Rimbert Rivera wrote:
> I have a cron job that runs the check_nagios plugin and e-mails us the

> output.  Earlier today, we started getting:
> "Nagios problem: located 3 processes, status log updated 1565 seconds
ago"
> 
> Everytime it ran, it was the same output with a longer time that it 
> wasn't updated.  This was working fine before where the status log 
> would be updated usually no longer than 8 seconds ago.  I checked the 
> status.log and status.sav and confirmed that they hadn't updated.  I 
> restarted nagios but I still had the same problem.  Even though none 
> of the partitions were running out of space, I deleted archived logs 
> and restarted nagios but same problem.  I did some more 
> troubleshooting without any luck.  Long story short, I rebooted the 
> RH9 box it was running on and nagios started running again.
> 
> Anyone have an idea of what could've happened and things I could
check?  
> This is the first time this has ever happened as far as I know.  The 
> recent changes we made were just setting up one new host to monitor so

> we edited hosts.cfg, hostgroups.cfg and services.cfg but nagios 
> restarted without error.  I even took out those changes and restarted 
> nagios but still had the same problem.  One thing I noticed was our 
> service-perfdata.out is 790 MB.  Can I delete this and nagios will 
> create a new one?  I'm not sure it's the problem since it's still that

> big and nagios is running now but it doesn't seem like I want that 
> file to get that big.

To your service-perfdata.out. If you don't need any performance data,
you can switch perfmormance data logging of. Depending on your
performance command definition, you can simply delete it. I don't know,
if you have changed that command. Look into misccomands.cfg, there it is
located by standard.



> 
> What kind of maintenance should I be performing on nagios?  We've had 
> it running for over a year and we haven't really did any kind of
cleanup on it.
> 
> Your help to this newbie would be greatly appreciated.


Stephan



-------------------------------------------------------
The SF.Net email is sponsored by: Beat the post-holiday blues
Get a FREE limited edition SourceForge.net t-shirt from ThinkGeek.
It's fun and FREE -- well, almost....http://www.thinkgeek.com/sfshirt
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null





More information about the Users mailing list