Nagios MTBF MTTR

Badri Pillai badri at diglinks.com
Thu May 20 09:21:39 CEST 2010


Hi,

Well the information from avail.cgi is not accurate but is useful.

BUT BUT the customer likes to have more info.
- what was the problem
  e.g service HTTP as CRITICAL could be System, Application, Network...
problem

- who solved it (may be he/she gets a pay raise based on the info 8-)

- how it was solved, such that similar problems could be handled
faster/better (Knowledgebase) 

PS: I would let Nagios do the monitoring and alerting, which is does quite
well compared to some very expensive Monitoring tools!!

Regards,

bp

-----Ursprüngliche Nachricht-----
Von: Martin Melin [mailto:mmelin at gmail.com] 
Gesendet: Mittwoch, 19. Mai 2010 20:44
An: Nagios Developers List
Betreff: Re: [Nagios-devel] Nagios MTBF MTTR

On Wed, May 19, 2010 at 7:41 PM, Christian Iñiguez
<challenger_joseph at yahoo.com.mx> wrote:
> Hi Everyone!
>
> I actually am using Nagios 3.2.0, and it has been very useful for us, but
recently I was told that we need to implement the measures MTBF (Mean Time
Between Failures) and MTTR (Mean Time To Repair) but do not know how.
>
> Could you help me if there is any tool or script or kind of report of
Nagios (or based on) to get this measures? Does anybody have implemented
this in Nagios?
>
> Any help would be very useful to me. I hope you guys can help me.
>
> Thanks in advance!!

I don't know of a released script that does this, but all the data is
available from core Nagios. MTBF is just (total uptime - total
downtime) / number of failures, and as long as you don't have any
definition of a fault being repaired other than "nagios records
recovery" MTTR is simply a matter of averaging the time between hard
DOWN and UP state changes.

For MTBF the avail.cgi output along with a count of recoveries should
IMHO get you a long way to this goal. MTTR requires some more logic
but should be relatively simple.

If there's sufficient interest I could probably try hacking up a draft
of this, but I'd like to see if anyone else on the list has a better
idea or working code first.

Best regards,
Martin Melin

----------------------------------------------------------------------------
--

_______________________________________________
Nagios-devel mailing list
Nagios-devel at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-devel



------------------------------------------------------------------------------




More information about the Developers mailing list