check for crash?

Paul Weaver paul.weaver at bbc.co.uk
Fri Nov 21 19:40:58 CET 2008


> I'm wondering if anyone can give me suggestions as to how to 
> monitor a specific process to make sure it keeps running 
> properly. The specific issue I have is that I have a Java 
> Webstart applet running on OpenBSD
> 4.4 that, from time to time, randomly crashes. I can't just 
> use check_process (I don't think so, at least) because when 
> it "crashes"  
> the process never actually exits- it just stops working. I 
> have to go in and manually kill it before I can restart it. 

When monitoring something new
1) how do you personally detect that its crashed
2) can that be automated? 
3) If so, write a plugin. 
4) If not, look at the things the applet has an affect on. For example: 

Imagine a webbrowser pointing at a refreshing webpage. You can tell the
webbrowser is up (check_process), but that doesn't tell you it's
actually refreshing the page. 

To check that, you could additionally monitor the apache log on the
webserver for the last time the IP address loaded the page

Something like 
tac /var/log/apache2/access.log|grep $IP|head -1|awk '{print $4" "$5}'

Without knowing what the applet does though, it's difficult to know how
to monitor it. 

-------------------------------------------------------------------------
This SF.Net email is sponsored by the Moblin Your Move Developer's challenge
Build the coolest Linux based applications with Moblin SDK & win great prizes
Grand prize is a trip for two to an Open Source event anywhere in the world
http://moblin-contest.org/redirect.php?banner_id=100&url=/
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null





More information about the Users mailing list