Log Rotations Issue

Alaric paxindustria at gmail.com
Thu Jan 31 17:53:18 CET 2013


On Jan 31, 2013, at 11:32 AM, Assaf Flatto <nagios at flatto.net> wrote:

> On 31/01/13 15:56, Alaric wrote:
>> Hi,
>> 
>> I was hoping that someone on this list might have some insight to an issue that I recently ran into after upgrading my Nagios core installation to 3.4.4 (out of the EPEL Repo) After upgrading, log rotation stopped on one of my two servers, and perfdata_file_processing_commands stopped working, and whatever Job that sets 'CURRENT HOST STATE' in the main nagios logs stopped working.
>> 
>> I upgraded both my Dev server, and my Production server, and only my Prod servers seems to have the issue.  Both run the same code, and both have the same configs.
>> I'm confident the configs as the same, as both get the configs deployed via puppet.  I've googled around quite a bit, and haven't had any luck figuring it out.  Has anyone seen anything similar?
>>   My feeling, based on the behavior of my Dev server is that it's not a problem with the code, but that something got "stuck"  but I'm darned if I can figure out what. I've cleared out the logs, restarted nagios, rebooted the server. Audited configs and checked the debug output.
>> 
>> Any help is much appreciated!
>> 
>> 
>> -a
>> 
>> 
>> 
>> 
> What is the difference in the volume of activity on those servers , it 
> could be that you found an  issue related the the amount of checks or 
> traffic generated to the the log.
> 
> If the internal log rotation is faulty - have you considered using 
> logrotate as a dirty hack to fix your issue ?
> 

While logrotate or a cronjob will clean up the actual files, part of what I'm trying to troubleshoot what looks like the failure of some internal nagios processes, for example, some processes nightly normally adds these enteries into the top of the log:

[1359608400] CURRENT HOST STATE: example1;UP;HARD;1;FPING OK - 10.1.2.3 (loss=0%, rta=1.210000 ms)

Which goes missing, even if I manually rotate the logs...  




The difference in volume seems pretty low i've been trying to keep dev and prod as similar as possible:


Host Service Checks from Dev:
# Active Host / Service Checks:	1486 / 7219
# Passive Host / Service Checks:	0 / 0

Host Service Checks form Prod:
# Active Host / Service Checks:	1564 / 8264
# Passive Host / Service Checks:	0 / 84



------------------------------------------------------------------------------
Everyone hates slow websites. So do we.
Make your web apps faster with AppDynamics
Download AppDynamics Lite for free today:
http://p.sf.net/sfu/appdyn_d2d_jan
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null





More information about the Users mailing list