check_rsync: (Service check did not exit properly)

Ryan Rawdon ryan at u13.net
Tue Jun 8 11:31:50 CEST 2010


Some additional information -

After poking around some more, I have two other plugins exhibiting the 
same behavior: check_irc.pl and check_mysql_replication.  Again with 
these, running them manually from a shell as the nagios user executes 
and outputs just fine.

Here's what is in a check result file for a check_rsync instance that I 
managed to grab before it was consumed:
### Active Check Result File ###
file_time=1275988933

### Nagios Service Check Result ###
# Time: Tue Jun  8 09:22:13 2010
host_name=vm-mx01.puttynuts.com
service_description=RSync Server
check_type=0
check_options=0
scheduled_check=1
reschedule_check=1
latency=0.226000
start_time=1275988933.227030
finish_time=1275988933.280759
early_timeout=0
exited_ok=1
return_code=2
output=(null)


Running with embedded perl off, I did not see that compilation error 
again for the rsync plugin, so I am leaving it using the external perl 
interpreter for now.


On 06/07/2010 07:39 PM, Ryan Rawdon wrote:
> Hey everyone,
>
> I recently installed Nagios on a new system and migrated all of my
> configs and everything over.  Everything is working fine, except for
> check_rsync, which fails on all hosts with "(Service check did not exit
> properly)."
>
> I have tried disabling the embedded perl interpreter (even though it
> worked fine in the last install), but that didn't appear to do anything
> except change the output to "null".
>
> Running the plugin by hand works 100% fine.
>
> What else should I check?  Here's the output with embedded perl (which
> is one of the things that led me to believe it was an embedded perl issue)
>
> [1275952035.062527] [016.0] [pid=2637] Attempting to run scheduled check
> of service 'RSync Server' on host 'vm-mx01.puttynuts.com': check
> options=0, latency=0.062000
> [1275952035.062585] [016.0] [pid=2637] Checking service 'RSync Server'
> on host 'vm-mx01.puttynuts.com'...
> [1275952035.062635] [2320.2] [pid=2637] Raw Command Input:
> $USER1$/check_rsync -H $HOSTADDRESS$
> [1275952035.062656] [2320.2] [pid=2637] Expanded Command Output:
> $USER1$/check_rsync -H $HOSTADDRESS$
> [1275952035.062782] [016.1] [pid=2637] Check result output will be
> written to '/var/lib/nagios3/spool/checkresults/checkg48PqE' (fd=7)
> [1275952035.062934] [016.1] [pid=2637] ** Using Embedded Perl
> interpreter to run service check...
> [1275952035.063208] [016.0] [pid=2637] Embedded Perl failed to compile
> /usr/lib/nagios/plugins/check_rsync, compile error **ePN failed to
> compile /usr/lib/nagios/plugins/chec$
> BEGIN failed--compilation aborted at (eval 1) line 39." at
> /usr/lib/nagios3/p1.pl line 161.
>    - skipping plugin
>
>
> Here's the output with embedded perl turned up (and I've since cranked
> up the logging level a bit):
>
> [1275953233.045782] [016.0] [pid=25338] Attempting to run scheduled
> check of service 'RSync Server' on host 'vm-mx01.puttynuts.com': check
> options=0, latency=0.045000
> [1275953233.045923] [016.0] [pid=25338] Checking service 'RSync Server'
> on host 'vm-mx01.puttynuts.com'...
> [1275953233.046002] [2320.2] [pid=25338] Raw Command Input:
> $USER1$/check_rsync -H $HOSTADDRESS$
> [1275953233.046013] [2320.2] [pid=25338] Expanded Command Output:
> $USER1$/check_rsync -H $HOSTADDRESS$
> [1275953233.046183] [016.1] [pid=25338] Check result output will be
> written to '/var/lib/nagios3/spool/checkresults/check6XvsV0' (fd=7)
> [1275953233.047328] [016.2] [pid=25338] Service check is executing in
> child process (pid=26948)
> [1275953233.098575] [016.2] [pid=26949] Moving temp check result file
> '/var/lib/nagios3/spool/checkresults/check6XvsV0' to queue file
> '/var/lib/nagios3/spool/checkresults/cpt5vVV'...
>
> and another excerpt:
> [1275953511.122166] [016.1] [pid=25338] Handling check result for
> service 'RSync Server' on host 'jester.u13.net'...
> [1275953511.122173] [016.0] [pid=25338] ** Handling check result for
> service 'RSync Server' on host 'jester.u13.net'...
> [1275953511.122179] [016.1] [pid=25338] HOST: jester.u13.net, SERVICE:
> RSync Server, CHECK TYPE: Active, OPTIONS: 0, SCHEDULED: Yes,
> RESCHEDULE: Yes, EXITED OK: Yes, RETURN CODE: 2, OUTPUT: (null)
> [1275953511.122209] [016.2] [pid=25338] Parsing check output...
> [1275953511.122215] [016.2] [pid=25338] Short Output: (null)
> [1275953511.122221] [016.2] [pid=25338] Long Output:  NULL
> [1275953511.122227] [016.2] [pid=25338] Perf Data:    NULL
> [1275953511.122233] [016.2] [pid=25338] ST: HARD  CA: 3  MA: 3  CS: 2
> LS: 2  LHS: 2
> [1275953511.122240] [016.1] [pid=25338] Service is in a non-OK state!
> [1275953511.122246] [016.1] [pid=25338] Host is currently UP, so we'll
> recheck its state to make sure...
> [1275953511.122252] [016.1] [pid=25338] * Using last known host state: 0
> [1275953511.122261] [016.1] [pid=25338] Current/Max Attempt(s): 3/3
> [1275953511.122267] [016.1] [pid=25338] Service has reached max number
> of rechecks, so we'll handle the error...
> [1275953511.122274] [016.1] [pid=25338] Checking service 'RSync Server'
> on host 'jester.u13.net' for flapping...
> [1275953511.122280] [016.2] [pid=25338] LFT=5.00, HFT=20.00, CPC=0.00,
> PSC=0.00%
> [1275953511.122288] [016.1] [pid=25338] Service is not flapping (0.00%
> state change).
> [1275953511.122294] [016.1] [pid=25338] Checking host 'jester.u13.net'
> for flapping...
> [1275953511.122308] [016.2] [pid=25338] LFT=5.00, HFT=20.00, CPC=0.00,
> PSC=0.00%
> [1275953511.122316] [016.1] [pid=25338] Host is not flapping (0.00%
> state change).
> [1275953511.122360] [016.1] [pid=25338] Rescheduling next check of
> service at Mon Jun  7 23:36:45 2010
> [1275953511.122380] [016.0] [pid=25338] Scheduling a non-forced, active
> check of service 'RSync Server' on host 'jester.u13.net' @ Mon Jun  7
> 23:36:45 2010
> [1275953511.122389] [016.2] [pid=25338] Scheduling new service check event.
> [1275953511.122409] [016.1] [pid=25338] Deleted check result file
> '/var/lib/nagios3/spool/checkresults/cwgRtGi'
> [1275953511.122416] [016.2] [pid=25338] Found a check result (#4) to
> handle...
>
>
> ... and it is when it is running like this with embedded perl off which
> results in "(null)" (instead of "(Service check did not exit
> properly)")on my services summary page as well a critical state.
>
> I look forward to hearing what suggestions you might have
>
> Ryan
>
>
>
> ------------------------------------------------------------------------------
> ThinkGeek and WIRED's GeekDad team up for the Ultimate
> GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the
> lucky parental unit.  See the prize list and enter to win:
> http://p.sf.net/sfu/thinkgeek-promo
> _______________________________________________
> Nagios-users mailing list
> Nagios-users at lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/nagios-users
> ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue.
> ::: Messages without supporting info will risk being sent to /dev/null
>    

------------------------------------------------------------------------------
ThinkGeek and WIRED's GeekDad team up for the Ultimate 
GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the 
lucky parental unit.  See the prize list and enter to win: 
http://p.sf.net/sfu/thinkgeek-promo
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null





More information about the Users mailing list