Difference between the timeout service and orphaned service

Yueh-Hung Liu yuehung.liu at gmail.com
Wed Jan 5 18:14:02 CET 2011


i think Nagios handles a check by the following flow:

[1]    schedule a check (into event queue)
[2]    time to go! fire the check (remove it from the queue)
[3]    wait for a result....
[4a]   check finished normally (got a result)
[4b]   timeout (terminate the check process)
[5]    go to [1] (reschedule)

an orphaned check could happen while Nagios in [3] but never proceed
to next step.
it can't reach [4a] because the checking process is dead. (killed by someone)
it can't reach [4b] because it has nothing to terminate. (process is
gone, Nagios loses its control)
finally, Nagios never go to [5], i.e. it won't reschedule and execute
the check again.

so, the answer to your question is yes.


2011/1/5 Yu Watanabe <yu.watanabe at jp.fujitsu.com>:
> Thank you for the reply.
>
> Could I ask you a further question?
>
> In the manual the orphaned services are defined as
>
> "This option allows you to enable or disable checks for orphaned service checks. Orphaned service
> checks are checks which ahve been executed and have been removed from the event queue, but have
> not had any results reported in a long time. Since no results have come back in for the service, it is not
> rescheduled in the event queue. This can cause service checks to stop being executed. Normally it is
> very rare for this to happen - it might happen if an external user or process killed off the process that
> was being used to execute a service check. "
>
> To be more specific, is this a kind of status that in the memory , nagios is
> acknowledging that this service is still checking but actually the plugin
> did not return anything although the process is already dead?
>
> Thank you for reading.
>
> Yu Watanabe
>
> Yueh-Hung Liu さんは書きました:
>>they are not totally related.
>>"service check timeout" tells when Nagios should kill a check process ACTIVELY.
>>"orphaned service check" would let Nagios find out whether a check
>>process is killed BY OTHERS.
>>
>>
>>On Wed, Jan 5, 2011 at 2:05 PM, Yu Watanabe <yu.watanabe at jp.fujitsu.com> wrote:
>>> Hello all.
>>>
>>> I'd like to ask a question about the service check timeout and the orphaned service check.
>>> I am looking at the manual but couldn't understand the major difference.
>>>
>>> Could someone give me an advice with the point of these two options?
>>>
>>> Thanks
>>> Yu Watanabe
>>>
>>>
>>> ------------------------------------------------------------------------------
>>> Learn how Oracle Real Application Clusters (RAC) One Node allows customers
>>> to consolidate database storage, standardize their database environment, and,
>>> should the need arise, upgrade to a full multi-node Oracle RAC database
>>> without downtime or disruption
>>> http://p.sf.net/sfu/oracle-sfdevnl
>>> _______________________________________________
>>> Nagios-users mailing list
>>> Nagios-users at lists.sourceforge.net
>>> https://lists.sourceforge.net/lists/listinfo/nagios-users
>>> ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue.
>>> ::: Messages without supporting info will risk being sent to /dev/null
>>>
>>
>>------------------------------------------------------------------------------
>>Learn how Oracle Real Application Clusters (RAC) One Node allows customers
>>to consolidate database storage, standardize their database environment, and,
>>should the need arise, upgrade to a full multi-node Oracle RAC database
>>without downtime or disruption
>>http://p.sf.net/sfu/oracle-sfdevnl
>>_______________________________________________
>>Nagios-users mailing list
>>Nagios-users at lists.sourceforge.net
>>https://lists.sourceforge.net/lists/listinfo/nagios-users
>>::: Please include Nagios version, plugin version (-v) and OS when reporting any issue.
>>::: Messages without supporting info will risk being sent to /dev/null
>>
>
>
> ------------------------------------------------------------------------------
> Learn how Oracle Real Application Clusters (RAC) One Node allows customers
> to consolidate database storage, standardize their database environment, and,
> should the need arise, upgrade to a full multi-node Oracle RAC database
> without downtime or disruption
> http://p.sf.net/sfu/oracle-sfdevnl
> _______________________________________________
> Nagios-users mailing list
> Nagios-users at lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/nagios-users
> ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue.
> ::: Messages without supporting info will risk being sent to /dev/null
>

------------------------------------------------------------------------------
Learn how Oracle Real Application Clusters (RAC) One Node allows customers
to consolidate database storage, standardize their database environment, and, 
should the need arise, upgrade to a full multi-node Oracle RAC database 
without downtime or disruption
http://p.sf.net/sfu/oracle-sfdevnl
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null





More information about the Users mailing list