Separate mail server problems cause Nagios to plotz (or vice versa?)

Terry Carmen terry at cnysupport.com
Fri Jun 24 19:42:47 CEST 2011


Quoting up at 3.am:

> We have Nagios monitoring a variety of services on roughly 50  
> separate servers.  Several of them
> are mail servers, but only the "main" (that contains most of the  
> Nagios notification recipients)
> one has this problem.
>
> The mail server will start to become unresponsive so just about any

> input (but pings fine).

This is a mail server issue. You would need to determine exactly what
process(es) have become unresponsive and why.

> Simultaneously, Nagios, which is on a separate server, will send
out
> notifications that every
> service on every server is down because Nagios cannot reach them. 


Why can't it reach them? Is your mail server also your router?

Terry

> Since almost all of them go
> through this problem mail server, including those that forward to  
> text messaging services, they
> will stop and resume again when the mail server is either rebooted,

> or otherwise is brought back
> to life...sometimes by restarting the LDAP server process on it.
>
> There are perhaps a few dozen total email destinations for  
> notifications.  Even multiplying this
> times the total number of services that Nagios monitors, it doesn't

> seem likely that it's just
> volume of emails generated by Nagios would cause all this.  It is
a
> fairly modern, multiprocessor
> server (CentOS/Sendmail).
>
> Can anyone offer any insight or similar experiences?
>
> Thanks in Advance!
>
>
------------------------------------------------------------------------------
> All the data continuously generated in your IT infrastructure
contains a
> definitive record of customers, application performance, security
> threats, fraudulent activity and more. Splunk takes this data and
makes
> sense of it. Business sense. IT sense. Common sense..
> http://p.sf.net/sfu/splunk-d2d-c1
> _______________________________________________
> Nagios-users mailing list
> Nagios-users at lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/nagios-users
> ::: Please include Nagios version, plugin version (-v) and OS when  
> reporting any issue.
> ::: Messages without supporting info will risk being sent to
/dev/null
>

 

-- 
Terry Carmen
CNY Support, LLC
Web. Database. Business.
http://www.cnysupport.com

------------------------------------------------------------------------------
All the data continuously generated in your IT infrastructure contains a 
definitive record of customers, application performance, security 
threats, fraudulent activity and more. Splunk takes this data and makes 
sense of it. Business sense. IT sense. Common sense.. 
http://p.sf.net/sfu/splunk-d2d-c1
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null





More information about the Users mailing list