Service hard state generation and host hard or soft down status

Paul Ezvan paul at ezvan.fr
Fri May 4 12:16:34 CEST 2012


Hi dear Nagios users,

I have some interrogation about hard state generation.

According to the documentation, one of the condition to create a hard 
non-ok state for a service is to get a check in a non-ok state while the 
associated host is down. But it is not stated if the host should be down 
HARD or not.

The current behavior of Nagios is clearly ignoring if the host is in 
SOFT or HARD down state, for example :

[1336039429] INITIAL HOST STATE: ces;UP;HARD;1;PING OK - Packet loss = 
0%, RTA = 0.42 ms
[1336039429] INITIAL SERVICE STATE: 
ces;SV-SE-Linux-Memoire;OK;HARD;1;OK: Memory Usage (W> 95): 12%Swap 
Usage (W> 95, C> 99): 0%
[1336039429] INITIAL SERVICE STATE: ces;SV-SE-Linux-SWAP;OK;HARD;1;SWAP 
OK - 100% free (3999 MB out of 3999 MB)
[1336039747] HOST ALERT: ces;DOWN;SOFT;1;CRITICAL - Host Unreachable 
(10.235.72.159)
[1336039812] HOST ALERT: ces;DOWN;SOFT;2;CRITICAL - Host Unreachable 
(10.235.72.159)
[1336039822] SERVICE ALERT: 
ces;SV-SE-Linux-SWAP;CRITICAL;HARD;1;Connection refused or timed out
[1336039822] SERVICE ALERT: 
ces;SV-SE-Linux-Memoire;CRITICAL;HARD;1;Connection refused or timed out
[1336039877] HOST ALERT: ces;DOWN;HARD;3;CRITICAL - Host Unreachable 
(10.235.72.159)
[1336040122] SERVICE ALERT: 
ces;SV-SE-Linux-Memoire;CRITICAL;HARD;1;Connection refused or timed out
[1336040122] SERVICE ALERT: 
ces;SV-SE-Linux-SWAP;CRITICAL;HARD;1;Connection refused or timed out

The associated service immediately get an HARD non-ok state even if the 
host is in a SOFT down state.

In the Nagios code I found in base/checks.c in non-ok state processing 
logic :

		/* if the host is down or unreachable ... */
		/* 05/29/2007 NOTE: The host might be in a SOFT problem state due to 
host check retries/caching.  Not sure if we should take that into 
account and do something different or not... */
		if(route_result != HOST_UP) {

I think we should take into account the SOFT or HARD host state to 
ensure consistency between host and service hard/soft state.

Is my analysis correct ?

What is your point of view about the above proposition ?

Thanks for reading,

cheers.

Paul Ezvan

------------------------------------------------------------------------------
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null





More information about the Users mailing list