A different way?

hemebond hemebond at gmail.com
Fri Sep 25 10:19:42 CEST 2009


Isn't this the same as using passive checks? It sounds like what I've set
up. I wrote a simple agent (script) that has its own schedule and runs the
checks, sending the result back to a Nagios server.

2009/9/25 Steven D. Morrey <smorrey at ldschurch.org>

> Hello everyone,
>
> I've decided to take a break for a bit from multi-threading nagios to focus
> on DNX since that is my day job after all :)
> While working on all of this I had a few thoughts that might make some good
> ideas if Nagios is ever re-designed again, say for a 4.x branch.
>
> As you know, under nagios, all checks are dispatched by nagios to be
> executed on the local machine at set intervals.
> Under a distributed nagios setup, you have multiple nagios instances
> running on various machines executing checks and passing the results back to
> a passive master controller.
>
> Under DNX, we distribute the load to "worker nodes" which then execute the
> checks and hand the results back to an active master controller that then
> processes the result etc.
>
> Profiling shows that (under DNX at least) 2/3rds of our time is spent in
> the reaper processing results, so  wouldn't it make more sense to flip the
> process around?
>
> The checks are already executing on the local machine, so how about a
> daemon on each machine, the daemon would keep the schedule and execute
> service checks locally, processing the result and returning the results and
> the required actions (based on a local policy) to nagios which would then do
> the actual work of handling notifications etc and so forth.
> This way nagios could be an auditor, if it doesn't receive a result on time
> as expected, then it could query the daemon to see whats gone wrong, if that
> fails then it could initiate a host check, etc.
>
> >From a design standpoint this is a bit more work than the current setup,
> but it seems to me that this could allow for much greater flexibility and
> scalability in the long run.
>
> Anyways I hope this sparks a little debate but I don't want to "come in and
> shake things up", or go around changing everything, stepping on toes all the
> while, it's just that putting the responsibility of actually executing the
> check and doing so on time, onto the computer it needs to execute on, just
> makes more sense to me.
> It's not really dramatically different from what we do now, it's just
> adding a scheduler/timer to the existing execution framework and adding
> something to push the original schedule and any changes such as scheduled
> downtime to the appropriate machines, putting everything else into a semi
> passive mode effectively turning each machine to be checked into it's own
> "worker node"
>
> Thoughts?
>
> Sincerely,
> Steve
>
>
>
>
>  NOTICE: This email message is for the sole use of the intended
> recipient(s) and may contain confidential and privileged information. Any
> unauthorized review, use, disclosure or distribution is prohibited. If you
> are not the intended recipient, please contact the sender by reply email and
> destroy all copies of the original message.
>
>
>
>
> ------------------------------------------------------------------------------
> Come build with us! The BlackBerry® Developer Conference in SF, CA
> is the only developer event you need to attend this year. Jumpstart your
> developing skills, take BlackBerry mobile applications to market and stay
> ahead of the curve. Join us from November 9-12, 2009. Register now!
> http://p.sf.net/sfu/devconf
> _______________________________________________
> Nagios-devel mailing list
> Nagios-devel at lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/nagios-devel
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://www.monitoring-lists.org/archive/developers/attachments/20090925/c0d92ed7/attachment.html>
-------------- next part --------------
------------------------------------------------------------------------------
Come build with us! The BlackBerry® Developer Conference in SF, CA
is the only developer event you need to attend this year. Jumpstart your
developing skills, take BlackBerry mobile applications to market and stay 
ahead of the curve. Join us from November 9-12, 2009. Register now!
http://p.sf.net/sfu/devconf
-------------- next part --------------
_______________________________________________
Nagios-devel mailing list
Nagios-devel at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-devel


More information about the Developers mailing list