Fault tolerant OCSP's

Jason Martin jhmartin at toger.us
Thu Sep 2 01:08:18 CEST 2004


Has anyone implemented a OCSP that is fault tolerant? The
failure scenario I am envisioning is a two-tier distributed
model. The distributed server detects a volatile alert, say a
logfile alert indicating that a disk has failed. It then calls
the OCSP for that alert to report it to the central server, but
a transient network failure causes send_nsca to fail.

send_nsca has no way of queueing the alert to be sent at a later
point up to the central server, and the nature of the alert is
not one that will necessairily repeat. The alert gets lost, no
notifications are sent from the central server and the the
machine eventually fails due to another disk failure since it
isn't configured to handle a 2-way disk failure.

Is there a simple way to maintainthe distributed Nagios setup
and also cover volatile alerts reliably?

Thanks,
-Jason Martin
-- 
Do NOT look into laser with remaining eyeball!
This message is PGP/MIME signed.
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 211 bytes
Desc: not available
URL: <https://www.monitoring-lists.org/archive/users/attachments/20040901/d2333845/attachment.sig>


More information about the Users mailing list