nagios core dump and restart when check_nrpe

Kelvin Xu kyoxu at hotmail.com
Fri Jan 4 05:00:34 CET 2008



Hi all,
 
I have just installed Nagios 3.0rc1 into a Solaris 10 machine. Everything is working fine except when i tried to do a check_nrpe on a remote host or localhost. I check my /var/adm/messages. Below is a section of the output:
 

Jan  4 10:16:39 pnsgsit1gw1 nagios[263]: [ID 702911 user.info] Caught SIGTERM, shutting down...Jan  4 10:16:39 pnsgsit1gw1 nagios[263]: [ID 702911 user.info] Successfully shutdown... (PID=263)Jan  4 10:16:39 pnsgsit1gw1 nagios[290]: [ID 702911 user.info] Nagios 3.0rc1 starting... (PID=290)Jan  4 10:16:39 pnsgsit1gw1 nagios[290]: [ID 702911 user.info] Local time is Fri Jan 04 10:16:39 SGT 2008Jan  4 10:16:39 pnsgsit1gw1 nagios[290]: [ID 702911 user.info] LOG VERSION: 2.0Jan  4 10:16:39 pnsgsit1gw1 nagios[291]: [ID 702911 user.info] Finished daemonizing... (New PID=291)Jan  4 10:17:53 pnsgsit1gw1 genunix: [ID 603404 kern.notice] NOTICE: core_log: nagios[302] setid process, core not dumped: /var/core/core.nagios.302.pnsgsit1gw1.210033.65541.1199413073Jan  4 10:17:53 pnsgsit1gw1 nagios[291]: [ID 702911 user.info] Caught SIGTERM, shutting down...Jan  4 10:17:53 pnsgsit1gw1 nagios[291]: [ID 702911 user.info] Successfully shutdown... (PID=291)Jan  4 10:17:53 pnsgsit1gw1 nagios[305]: [ID 702911 user.info] Nagios 3.0rc1 starting... (PID=305)Jan  4 10:17:53 pnsgsit1gw1 nagios[305]: [ID 702911 user.info] Local time is Fri Jan 04 10:17:53 SGT 2008Jan  4 10:17:53 pnsgsit1gw1 nagios[305]: [ID 702911 user.info] LOG VERSION: 2.0Jan  4 10:17:53 pnsgsit1gw1 nagios[306]: [ID 702911 user.info] Finished daemonizing... (New PID=306)
This will repeat every few minutes and will not occur when i remove the nrpe service monitoring from the configuration. 
I tried to do a /usr/local/nagios/libexec/check_nrpe -H pnsgsit1gw2 -c check_load, The output seems fine except there is some addition characters appended to end. 
OK - load average: 0.00, 0.00, 0.00|load1=0.000;15.000;30.000;0; load5=0.000;10.000;25.000;0; load15=0.000;5.000;20.000;0;ÿ¿àpÿ:
Below is my host configuration:
define host{        use     linux-server        host_name pnsgsit1gw2 ; The name we're giving to this server        alias pnsgsit1gw2 ; A longer name for the server        address 10.106.127.106 ; IP address of the server}
define service{        use                             local-service        host_name                       pnsgsit1gw2        service_description             NRPE        check_command                   check_nrpe!check_load}
define service{        use                             local-service         ; Name of service template to use        host_name                       pnsgsit1gw2        service_description             HTTP        check_command                   check_http        notifications_enabled           0        }
 
 
Anyone has any ideas of what could be the problem?
 
Thanks
 
Regards,
Kelvin Xu
 
_________________________________________________________________
Get your free suite of Windows Live services today!
http://www.get.live.com/wl/all
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://www.monitoring-lists.org/archive/users/attachments/20080104/710d6a2c/attachment.html>
-------------- next part --------------
-------------------------------------------------------------------------
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2005.
http://clk.atdmt.com/MRT/go/vse0120000070mrt/direct/01/
-------------- next part --------------
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null


More information about the Users mailing list