softstates and retry check intervals

Tharanga tharanga at roomsnet.com
Thu Jan 5 04:02:41 CET 2006


here is the status.log

[1136429754] SERVICE;DB-server;cpu
load;OK;1/30;HARD;1136429719;1136429744;ACTIVE;1;1;1;1136414173;0;OK;4877981
;0;50829;42026;0;0;1;8;2;1;0;0.00;0;1;1;1;OK - load average: 0.74, 0.86,
0.88
[1136429754]
SERVICE;-DB-server;ping;OK;1/10;HARD;1136429739;1136429789;ACTIVE;1;1;1;1136
412427;0;OK;4969226;0;0;46;0;0;1;0;1;1;0;0.00;0;1;1;1;PING OK - Packet loss
= 0%, RTA = 0.06 ms
[1136429754] SERVICE;Current
Users;OK;1/30;HARD;1136429743;1136429768;ACTIVE;1;1;1;1136377184;0;OK;429467
;0;12388;17;0;0;0;0;2;1;0;0.00;0;1;1;1;USERS OK - 1 users currently logged
in

---THARANGA--


----- Original Message ----- 
From: "Danny Russell" <drussell at mpifix.com>
To: "Tharanga" <tharanga at roomsnet.com>; <nagios-users at lists.sourceforge.net>
Sent: Thursday, January 05, 2006 8:45 AM
Subject: RE: [Nagios-users] softstates and retry check intervals


Email us the status.log that contains the checks of the service.

-----Original Message-----
From: Tharanga [mailto:tharanga at roomsnet.com]
Sent: Wednesday, January 04, 2006 7:29 PM
To: Danny Russell; nagios-users at lists.sourceforge.net
Subject: Re: [Nagios-users] softstates and retry check intervals

Thxs again...for the mail...but thats not working..iam using version
1.2.
can it be a problem ??
it alerts every time when nrpe time out occurs. it wont check 3
attempts..in
the first failure it gives the crtical state..and alert..this is a real
problem to my administrators.

thanks for ur time..
rgds
Tharanga



----- Original Message ----- 
From: "Danny Russell" <drussell at mpifix.com>
To: "Tharanga" <tharanga at roomsnet.com>;
<nagios-users at lists.sourceforge.net>
Sent: Thursday, January 05, 2006 8:18 AM
Subject: RE: [Nagios-users] softstates and retry check intervals


That should work, here is a sample from my config:

define service {
        name --template Customer Service
        max_check_attempts 5
        normal_check_interval 10
        retry_check_interval 3
        active_checks_enabled 1
        check_period 24x7
        process_perf_data 1
        retain_status_information 1
        retain_nonstatus_information 1
        notification_interval 30
        notification_period ShopHours
        notifications_enabled 1
        register 0
        notification_options u,c,r
        servicegroups Customer Services
        contact_groups FieldOpsSupport,TechOpsSupport
}

-----Original Message-----
From: Tharanga [mailto:tharanga at roomsnet.com]
Sent: Wednesday, January 04, 2006 7:03 PM
To: Danny Russell; nagios-users at lists.sourceforge.net
Subject: Re: [Nagios-users] softstates and retry check intervals

Hello Danny,

Thxs for the immeidate reply. actualy i need to do this
"In order to prevent false alarms, Nagios allows you to define how many
times a service or host check will be retried before the service or host
is
considered to have a real problem."

but my one process went to NRPE time out..it gives critical alert..but
its a
false alarm. if nagios can check that service 3 times..if all three
times
fails..it should alert...
this is my servises.cfg


define service{
       use                             generic-service         ; Name of
service template to use

       host_name                       Linux-PBX
       service_description             PBX-Asterisk process
       is_volatile                     0
       check_period                    24x7
       max_check_attempts              3
       normal_check_interval           1
       retry_check_interval            1
       contact_groups                  linux-admins
       notification_interval           240
       notification_period             24x7
       notification_options            w,u,c,r
       check_command                   check_snmp_process!asterisk
       }

thxs in advance
Tharanga



----- Original Message ----- 
From: "Danny Russell" <drussell at mpifix.com>
To: "Tharanga" <tharanga at roomsnet.com>;
<nagios-users at lists.sourceforge.net>
Sent: Thursday, January 05, 2006 7:46 AM
Subject: RE: [Nagios-users] softstates and retry check intervals


Notifications are only sent out for HARD States. The retry check
interval is how quickly should I start checking as soon as this service
is in an non-okay status. The maximum check attempts is how many
consecutive checks must be the same before it is considered a hard
state, up until then it was a soft state.

This can probably explain it better:
http://nagios.sourceforge.net/docs/2_0/statetypes.html



-----Original Message-----
From: nagios-users-admin at lists.sourceforge.net
[mailto:nagios-users-admin at lists.sourceforge.net] On Behalf Of Tharanga
Sent: Wednesday, January 04, 2006 6:38 PM
To: nagios-users at lists.sourceforge.net
Subject: [Nagios-users] softstates and retry check intervals

Greetings,

My nagios system alerts as critical when the system gives CHECK_NRPE
time
out. so..i need to retry 3 attempts using retry intervals..then after
this 3
checks i want to send the critical alert. but theres a method called
softstates in nagios.

can any one help me to configure this..
thxs in advance
Tharanga




-------------------------------------------------------
This SF.net email is sponsored by: Splunk Inc. Do you grep through log
files
for problems?  Stop!  Download the new AJAX search engine that makes
searching your log files as easy as surfing the  web.  DOWNLOAD SPLUNK!
http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when
reporting any issue.
::: Messages without supporting info will risk being sent to /dev/null






-------------------------------------------------------
This SF.net email is sponsored by: Splunk Inc. Do you grep through log files
for problems?  Stop!  Download the new AJAX search engine that makes
searching your log files as easy as surfing the  web.  DOWNLOAD SPLUNK!
http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null





More information about the Users mailing list