RFC 1 Check_Open_Connections 2 SLA attribute for templates.

Stanley Hopcroft Stanley.Hopcroft at IPAustralia.Gov.AU
Wed Jul 23 04:40:17 CEST 2003


Dear Ladies and Gentlemen,

I am writing to request your comments on 

1 A plugin to check that the number of open TCP connections is less than 
a threshold.

This plugin is inspired by the Orcallator monitor of 'Number of Open TCP 
connections' and made essential by my employers famous name application 
server (name withheld since this is a family program) dieing early one 
morning when the hosts limit was breached.

It is

. implemented in Perl

. not portable (basically its a Netstat filter that needs to know the 
Netstat format of the host)

. easy to hack because it is so-so thin

Here is the sample output (Sol version).

[asterix]/usr/home/nagios$ perl/check_open_connections
Ok. Number of open TCP connections (362) less than threshold. Top 
connectors: 10.0.100.201  108, 10.0.100.65  49, 192.168.101.210  27, 
192.168.102.210  24, 192.168.103.210  22.

[asterix]/usr/home/nagios$ perl/check_open_connections -h
check_open_connections (netsaint-plugins 1.2.9-4) 1.3
The netsaint plugins come with ABSOLUTELY NO WARRANTY. You may 
redistribute
copies of the plugins under the terms of the GNU General Public License.
For more information about these matters, see the file named COPYING.
Copyright (c) 2003 Stanley Hopcroft

Perl check of open TCP connections

Returns OK if the number of open TCP Connections (ie those in the 
etsablished state) is less than or equal to MAX_OPEN_CONNECTIONS

Usage: check_open_connections

-V, --version
   Plugin version.
-d, --debug
   Debugging output.
-h, --help
   This stuff.

Send email to netsaint-users at lists.sourceforge.net if you have questions
regarding use of this software. To submit patches or suggest 
improvements,
send email to netsaintplug-devel at lists.sourceforge.net

[asterix]/usr/home/nagios$ uname -a
SunOS asterix 5.8 Generic_108528-20 sun4u sparc 

2 A new SLA attribute to the template configuration.

The SLA attribute would have values like those of the check period or 
notification period (eg 24x7, workhours etc) or plain percentages (eg 
<grin>99.99</grin>)

It is for add-on projects to be able to compute SLA conformance by 
comparing the Nag measured availability with that required by the SLA.

Computing SLA conformance then becomes 

1 either compare the OK% (from the Nag avail.cgi) with the percentage 
value Or

2 the much more challenging one (that is probably optional or best left 
for later) for SLAs phrased like 'workhours', 

Modify avail.cgi to check if outage intervals lie inside an SLA period 
and compute the SLA appropriately.

In the latter case, the SLA attribute may have to contain more data like
'99.5% of workhours'.

Yours sincerely.


-- 
------------------------------------------------------------------------
Stanley Hopcroft
------------------------------------------------------------------------

'...No man is an island, entire of itself; every man is a piece of the
continent, a part of the main. If a clod be washed away by the sea,
Europe is the less, as well as if a promontory were, as well as if a
manor of thy friend's or of thine own were. Any man's death diminishes
me, because I am involved in mankind; and therefore never send to know
for whom the bell tolls; it tolls for thee...'

from Meditation 17, J Donne.


-------------------------------------------------------
This SF.net email is sponsored by: VM Ware
With VMware you can run multiple operating systems on a single machine.
WITHOUT REBOOTING! Mix Linux / Windows / Novell virtual machines at the
same time. Free trial click here: http://www.vmware.com/wl/offer/345/0
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null





More information about the Users mailing list