check_rsync: (Service check did not exit properly)

Ryan Rawdon ryan at u13.net
Tue Jun 8 01:39:44 CEST 2010


Hey everyone,

I recently installed Nagios on a new system and migrated all of my 
configs and everything over.  Everything is working fine, except for 
check_rsync, which fails on all hosts with "(Service check did not exit 
properly)."

I have tried disabling the embedded perl interpreter (even though it 
worked fine in the last install), but that didn't appear to do anything 
except change the output to "null".

Running the plugin by hand works 100% fine.

What else should I check?  Here's the output with embedded perl (which 
is one of the things that led me to believe it was an embedded perl issue)

[1275952035.062527] [016.0] [pid=2637] Attempting to run scheduled check 
of service 'RSync Server' on host 'vm-mx01.puttynuts.com': check 
options=0, latency=0.062000
[1275952035.062585] [016.0] [pid=2637] Checking service 'RSync Server' 
on host 'vm-mx01.puttynuts.com'...
[1275952035.062635] [2320.2] [pid=2637] Raw Command Input: 
$USER1$/check_rsync -H $HOSTADDRESS$
[1275952035.062656] [2320.2] [pid=2637] Expanded Command Output: 
$USER1$/check_rsync -H $HOSTADDRESS$
[1275952035.062782] [016.1] [pid=2637] Check result output will be 
written to '/var/lib/nagios3/spool/checkresults/checkg48PqE' (fd=7)
[1275952035.062934] [016.1] [pid=2637] ** Using Embedded Perl 
interpreter to run service check...
[1275952035.063208] [016.0] [pid=2637] Embedded Perl failed to compile 
/usr/lib/nagios/plugins/check_rsync, compile error **ePN failed to 
compile /usr/lib/nagios/plugins/chec$
BEGIN failed--compilation aborted at (eval 1) line 39." at 
/usr/lib/nagios3/p1.pl line 161.
  - skipping plugin


Here's the output with embedded perl turned up (and I've since cranked 
up the logging level a bit):

[1275953233.045782] [016.0] [pid=25338] Attempting to run scheduled 
check of service 'RSync Server' on host 'vm-mx01.puttynuts.com': check 
options=0, latency=0.045000
[1275953233.045923] [016.0] [pid=25338] Checking service 'RSync Server' 
on host 'vm-mx01.puttynuts.com'...
[1275953233.046002] [2320.2] [pid=25338] Raw Command Input: 
$USER1$/check_rsync -H $HOSTADDRESS$
[1275953233.046013] [2320.2] [pid=25338] Expanded Command Output: 
$USER1$/check_rsync -H $HOSTADDRESS$
[1275953233.046183] [016.1] [pid=25338] Check result output will be 
written to '/var/lib/nagios3/spool/checkresults/check6XvsV0' (fd=7)
[1275953233.047328] [016.2] [pid=25338] Service check is executing in 
child process (pid=26948)
[1275953233.098575] [016.2] [pid=26949] Moving temp check result file 
'/var/lib/nagios3/spool/checkresults/check6XvsV0' to queue file 
'/var/lib/nagios3/spool/checkresults/cpt5vVV'...

and another excerpt:
[1275953511.122166] [016.1] [pid=25338] Handling check result for 
service 'RSync Server' on host 'jester.u13.net'...
[1275953511.122173] [016.0] [pid=25338] ** Handling check result for 
service 'RSync Server' on host 'jester.u13.net'...
[1275953511.122179] [016.1] [pid=25338] HOST: jester.u13.net, SERVICE: 
RSync Server, CHECK TYPE: Active, OPTIONS: 0, SCHEDULED: Yes, 
RESCHEDULE: Yes, EXITED OK: Yes, RETURN CODE: 2, OUTPUT: (null)
[1275953511.122209] [016.2] [pid=25338] Parsing check output...
[1275953511.122215] [016.2] [pid=25338] Short Output: (null)
[1275953511.122221] [016.2] [pid=25338] Long Output:  NULL
[1275953511.122227] [016.2] [pid=25338] Perf Data:    NULL
[1275953511.122233] [016.2] [pid=25338] ST: HARD  CA: 3  MA: 3  CS: 2  
LS: 2  LHS: 2
[1275953511.122240] [016.1] [pid=25338] Service is in a non-OK state!
[1275953511.122246] [016.1] [pid=25338] Host is currently UP, so we'll 
recheck its state to make sure...
[1275953511.122252] [016.1] [pid=25338] * Using last known host state: 0
[1275953511.122261] [016.1] [pid=25338] Current/Max Attempt(s): 3/3
[1275953511.122267] [016.1] [pid=25338] Service has reached max number 
of rechecks, so we'll handle the error...
[1275953511.122274] [016.1] [pid=25338] Checking service 'RSync Server' 
on host 'jester.u13.net' for flapping...
[1275953511.122280] [016.2] [pid=25338] LFT=5.00, HFT=20.00, CPC=0.00, 
PSC=0.00%
[1275953511.122288] [016.1] [pid=25338] Service is not flapping (0.00% 
state change).
[1275953511.122294] [016.1] [pid=25338] Checking host 'jester.u13.net' 
for flapping...
[1275953511.122308] [016.2] [pid=25338] LFT=5.00, HFT=20.00, CPC=0.00, 
PSC=0.00%
[1275953511.122316] [016.1] [pid=25338] Host is not flapping (0.00% 
state change).
[1275953511.122360] [016.1] [pid=25338] Rescheduling next check of 
service at Mon Jun  7 23:36:45 2010
[1275953511.122380] [016.0] [pid=25338] Scheduling a non-forced, active 
check of service 'RSync Server' on host 'jester.u13.net' @ Mon Jun  7 
23:36:45 2010
[1275953511.122389] [016.2] [pid=25338] Scheduling new service check event.
[1275953511.122409] [016.1] [pid=25338] Deleted check result file 
'/var/lib/nagios3/spool/checkresults/cwgRtGi'
[1275953511.122416] [016.2] [pid=25338] Found a check result (#4) to 
handle...


... and it is when it is running like this with embedded perl off which 
results in "(null)" (instead of "(Service check did not exit 
properly)")on my services summary page as well a critical state.

I look forward to hearing what suggestions you might have

Ryan



------------------------------------------------------------------------------
ThinkGeek and WIRED's GeekDad team up for the Ultimate 
GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the 
lucky parental unit.  See the prize list and enter to win: 
http://p.sf.net/sfu/thinkgeek-promo
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null





More information about the Users mailing list